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SYSTEMATIC POLYPEPTIDE -EVOLUTION. 
BY REVERSE TRANSLATION 



f TF^ P OF THE INVENTION 

We describe herein novel high-affinity 
polypeptide ligands^ that specif ically' bind a' desired 
Wrget*^ represented for Electing a 

polypeptide ligand that specifically binds any desired 
target molecule. The method is termed SPERT, an 
acronym for Systematic Polypeptide Evolution by Reverse 
Translation- The method of the invention (SPERT) is 
useful to isolate a polypeptide ligand for a desired 
target molecule. The polypeptide products of the 
invention are useful for any purpose to which a binding 
reaction may be put, for example in assay methods, 
diagnostic procedures, cell sorting, as inhibitors of 
target molecule function, as probes, as sequestering 
agents and the like. In addition, polypeptide products 
of the invention can have catalytic activity. Target 
molecules include natural and synthetic polymers,' 
including proteins, polysaccharides, glycoproteins, 
hormones, receptors and cell surfaces, nucleic acids, 
and small molecules such as drugs, metabolites, 
cof actors, transition state analogs and toxins. 

RACKGROUND OF THE INVENTION 

As translation of mRNA proceeds, stable 
complexes are formed. These complexes are made of 
ribosomes bound to mRNA with tRNA and nascent 
polypeptide encoded by the messenger RNA. Termed 
"ribosome complexes" herein, such complexes can be 
isolated by various known processes (Connolly and 
Gilmore (1986) J. Cell Biol. 103:2253; Perara et al. 
. (1986) Science 232:348). Antigen-encoding mRNAs have 
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beeh purified by taking • advantage of the 
immunoreactivity of nascent polypeptides associated 
with ribosome complexes (Sambrook, J., Fritsch, E.F.., 
Maniatis,. T. Molecular cloning; A laboratory Manual 
(Cold Spring Harbor, NY) (1989) 2&4> sections 8.9- 
a ibl. Such immunoreactive ribosome complexes can be 
immunop'recipitated from solution, or separated by 
protein A column chromatography from non-reactive 
ribosome complexes (Schutz et al. (1977) Nuc. Acids 
Res. ,4, 71; Shapiro and Young (1981) J. Biol. Chem. 
256 , 1495). Cyclical selection and amplification of 
RNAs with partitionabie properties is now also 
possible. Historically, mRNA selection is closely tied 
to immunopurification of ribosome complexes, however, 
the partitioning of ribosome complexes according to the 
present invention is not restricted to ' immunoreactivity 
of the nascent polypeptides. 
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fiTTMMARY OF THE INVENTION 

In its broadest aspect, the method of 
systematic polypeptide evolution by reverse translation 
(SPERT) includes a candidate mixture of polypeptides 
having a randomized amino acid sequence. Each member 
of the mixture is linked to an individualized mRNA 
which encodes the amino acid sequence of that 
polypeptide. The candidate polypeptides are 
partitioned according to their property of binding to a 
given desired target molecule. The partitioning is 
carried out in such a way, herein described, that each 
mRNA encoding a polypeptide is partitioned exactly 
together with that polypeptide. In this way each 
polypeptide is partitioned together with the means for 
further amplifying it by an in vitro process. 
Ultimately, both the desired optimal polypeptide ligand 
of the desired target and the mRNA encoding the 
polypeptide are simultaneously selected, allowing 
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. - ^further synthesis, of ;.1:he selected, polypeptide as 
- desired , and further amplification of the, coding 
sequence. It is therefore not necessary to analyze the 
amino acid sequence of the selected polypeptide (using 
5 protein ^chemistry), in-order. to produce it in desired 

Viewed another way, the invention is the 

selective evolution of a nucleic acid that encodes a 
polypeptide ligand of a desired target. The present 

10 method is therefore a selection based upon coding 
properties available in a candidate nucleic acid 
mixture. In previously filed applications, U.S. Serial 
No. 07/536,428, filed June 11, 1990 and U.S. Serial No. 
07/714,131 filed July 10, 1991, both of which are 

15 incorporated herein by reference, the inventors herein 
have taught a method for selective evolution of nucleic 
acids based upon binding 'properties of the nucleic 
acids 'themselves. . The insight that cyclical selection 
and amplification can be a powerful tool for developing 

20 novel compounds when coupled with a partitioning system 
is herein adapted to evolving specific coding nucleic 
acids, based on the partitioning properties of 
polypeptide ligands binding to target molecules. 

More specifically, the invention includes a 

25 method for making a polypeptide ligand of a desired 
target molecule which includes the following steps: 
First, synthesizing a mixture of translatable mRNA's, 
having certain sequence segments in common such as a 
ribosome binding site and a translation initiation 

30 codon and having a segment encoding a polypeptide at 
least part of which coding region is a randomized 
sequence. Second, employing the mRNA mixture in an in 
vitro translation system. Synthesis of nascent 
polypeptides ensues, each encoded by its own mRNA. At 

3 5 any time during translation, stable ribosome complexes 
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cante^solatedr ±t L iB preferred'^ isolate ^ciMnplexes 
in which translation has been stopped, or "stalied". by 
any of several known circumstances. Each isolated 
ribosome complex includes at least one ribosome, one 
nascent peptide and the coding mRNA which is now said 
to be translated mRNA. Although its chemical* structure 
is unaltered, translated mRNA*"is' bound to the ribosome 
complex in a different manner than it was bound prior 
to translation, as is known in the art. Third, the 
ribosome complexes are partitioned with respect to the 
binding of each nascent polypeptide to a desired target 
molecule. Some polypeptides bind weakly, some tightly, 
some not at all, with the target. The partitioning, 
however conducted, generally separates the mixture of 
ribosome complexes into ribosome complex-target pairs 
and unbound complexes. The set of ribosome complex- 
target pairs is thereby enriched for those polypeptides 
(and, necessarily their coding mRNA's) that can bind to 
the. target. Fourth, the encoding mRNA's are separated 
from the complexes and amplified by conventional means 
for amplifying nucleic acids, such as reverse 
transcription and polymerase chain reaction (PCR) . 
This amplification sets the stage for a subsequent 
round of transcription, polypeptide synthesis and 
partitioning to further enrich for target-binding 
polypeptide ligands. These cycles can be reiterated as 
many times as desired, until a desired binding affinity 
is achieved, or no further improvement in binding 
affinity is observed. The coding mRNA for any 
polypeptide selected in the foregoing manner can be 
cloned and sequenced, if desired. An individual 
polypeptide ligand can then be prepared in vivo from 
cloned coding mRNA, or by chemical or enzymatic methods 
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. In an, alternate embodiment of, the present 
•invention, means for linking the nascent polypeptide to 
the translated mRNA are included in the design of the 
system. According to this method, a direct 
.'connection-^ v ©nf tight 

af f inity interactions—between' the polypeptide and the 
mRNA allows for the removal of the ribosomal linkage 
between these two elements leaving mRNA •polypeptide 
copolymers. By removing the relatively^ large ribosome 
from the mRNA polypeptide copolymer, the ability to 
partition polypeptides based on the affinity of the 
randomized polypeptides to a given target may be 
greatly increased. In addition, the ribosome is then 
freed to translate additional mRNA species. The fewer 
ribosomes that can be utilized, the more randomized 
polypeptides can be generated in the process. In a 
specific example of this embodiment, a biotin molecule 
is covalently bound to the 5' end of the mRNA sequence 
utilized, and the nucleic acid template includes a 
fixed sequence in the translated region that encodes a 
polypeptide that may be covalently bound to biotin. 

The present invention provides a class of 
products which are polypeptides, each having a unique 
sequence, each of which has the property of binding 
specifically to a desired target compound or molecule. 
Each compound of the invention is a specific ligand of 
a given target molecule. The invention is based on the 
unique insight that . cyclical selection and 
amplification of nucleic acids can be applied to coding 
sequences by partitioning such coding sequences 
according to the binding affinities of the encoded 
" polypeptides. In vitro evolutionary selection can 
therefore be applied for the first time to up to about 
10 18 different polypeptides. Polypeptides have 
sufficient capacity for forming a variety of two- and 
three-dimensional structures and sufficient chemical 
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versatility available within their monomers, to act as 
ligands (form specific binding pairs) with virtually 
any chemical compound, whether monomeric or polymeric. 
Molecules of any size can serve as targets. Most 1 
commonly, and preferably, for therapeutic applications, 
binding takes place in aqueous solution" at conditions 
of" salt, temperature and pH near acceptable 
physiological limits. For other uses different binding 
conditions can be employed. 

The invention also provides a method which is 
generally applicable to make a polypeptide ligand for 
any desired target. The method involves selection from 
a mixture of candidates and step-wise iterations of 
structural improvement, using the same general 
selection theme, to achieve virtually any desired 
criterion of binding affinity and selectivity. 

While~"riot bound by a theory of operation, SPERT 
is based on the inventors' insight that within a 
polypeptide mixture containing a large number of . 
possible sequences and structures there is a wide range 
of binding affinities for a given target. A 
polypeptide mixture comprising, for example a 10 amino 
acid randomized segment can have 20 10 candidate 
possibilities. Those which have the higher affinity 
constants for the target are most likely to bind. 
After partitioning ribosome complexes or 
mRNA • polypeptide copolymers, dissociation of mRNA and 
reverse transcription/ amplification/ transcription, a 
second polypeptide mixture is generated by translation, 
enriched for the higher binding affinity candidates. 
Additional rounds of SPERT progressively favor the best 
ligands until the resulting polypeptide mixture is 
predominantly composed of only one or a few sequences. 
These can then be individually synthesized and tested 
for binding affinity as pure ligands. One cycle of 
SPERT effectively achieves reverse translation, at 
least quantitatively. 
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1 ' i.. • . The ability% to .rapidly* select, a. single sequence 
or family of sequences from a huge number of candidates 
hasbeen dramatically shown in the nucleic acid area. 
In U.S. Patent Application Serial No. 07/714,131 , 
5 (referred to herein, along with U.S. Patent Application 

^serial No." 07/536, 428 >v as the. SELEX, Applications ),, 
nucleic acid ligands to a variety of targets-- including 
both protein targets that are known to bind nucleic 
acids and protein targets that are not known to bind 

10 nucleic acids — have been identified. In such 
application there is also a description of a 
mathematical analysis of the partitioning and cycling 
aspects of SELEX referred to as SELEXION. This 
mathematical analysis dramatically demonstrated that by 

15 cycling through the partitioning process a number of 

times at a moderate stringency it is possible to obtain 
the individual species in a randomized mixture which 
have the highest affinity to the selected target. 

In actual practice, the SELEX Applications show 

20 that although in some cases a single solution nucleic 
acid ligand may be identified, it is more often the 
case that a family of ligands is identified having 
similar affinity to the target. The family of ligands 
was shown to generally have the same three dimensional j 

25 configuration and many conserved sequences. 

Surprisingly, in some cases where the target was a 
nucleic acid binding protein, the SELEX process was 
able to identify a ligand solution that had a higher 
affinity to the protein than the sequence that the 

30 protein binds to in nature. These results emphasize 
the practicality of "short cutting" the evolutionary 
process by screening a mixture containing a very large 
number of candidates. 

Cycles of selection and amplification are 

35 repeated until a desired goal is achieved. In the most 
general case, selection/ amplification is continued 
until no significant improvement in binding strength is 
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f " • achieved' on repetition' of the' cycle, .r The iterative 
selection/amplification method. is' sensitive enough to 
allow isolation of two sequence variants in a mixture 
containing at least 65,000 sequence variants. The 
method could, in practice, be used to sample .about 10 1 
different polypeptide species. There -is no. upper 
limit; in principle, to the number 'of different 
polypeptides which could be sampled, only a practical 
limit dictated by the sizes of reaction vessels and 
other containers necessary to perform the method. The 
polypeptides of the test mixture include a randomized 
sequence portion as well as conserved sequences as 
desired for combining with other functional domains or 
to provide sufficient polypeptide length to insure that 
the randomized sequence is accessible to the target in 
the ribosome complex or mRNA* polypeptide copolymer. 
Amino acid sequence variants can be produced in a 
number of ways including chemical or enzymic' synthesis 
of randomized nucleic acid coding sequences. The 
variable sequence portion may contain fully or 
partially random sequence; it may also contain 
subportions of conserved sequence incorporated with 
randomized sequence. Sequence variation in coding 
nucleic acids can be introduced or increased by 
mutagenesis before or during the 
selection/amplification iterations . 

In the case of a polymeric target, such as a 
protein, the ligand affinity can be increased by 
applying SPERT to a mixture of candidates comprising a 
first selected polypeptide sequence combined with a 
second randomized sequence. The sequence of the first 
selected ligand associated with binding or subportions 
thereof can be introduced into the randomized portion 
of the amino acid sequence of a second test mixture. 
The SPERT procedure is repeated with this second test 
mixture to isolate a second polypeptide ligand, having 
two sequences (one being the first polypeptide ligand) 



selected' for binding to the target,, which has increased 
binding strength or increased* specificity of binding 
compared to the first polypeptide ligand isolated. The 
sequence of the second polypeptide ligand associated 
with binding to the , target can then be introduced* near 
the r variable portion of ; the, amino acid sequence after 
which cycles of SPERT results in a third polypeptide 
ligand. The third polypeptide ligand also contains the 
first and second ligand previously selected. These 
procedures can be repeated until a polypeptide ligand 
of a desired binding strength or a desired specificity 
of binding to the target molecule is achieved. The 
process of iterative selection and combination of 
polypeptide sequence elements that bind to a selected 
target molecule is herein designated "walking," a term 
which implies the optimized binding to other accessible 
areas of a macromolecular target surface or cleft, 
starting from a first binding domain. Increasing the 
area of binding contact between ligand and target can 
increase the affinity constant of the binding reaction. 
These walking procedures are particularly useful for 
isolating novel polypeptides which are highly specific 
for binding to a particular target molecule. 

A variant of t^e walking procedure employs a 
ligand termed "anchor" which is known to bind to the 
target molecule at a first binding domain (See Figure 
8). This anchor molecule can in principle be any 
molecule that binds to the target molecule and which 
can be covalently linked directly or indirectly to a 
small bridge molecule for which a peptide binding 
sequence is known. When the target molecule is an 
enzyme, for example, the anchor molecule can be an 
inhibitor or substrate of that enzyme. The anchor can 
also be an antibody or antibody fragment specific for 
the target. The anchor molecule is covalently linked 
to the bridge molecule, chosen to bind an oligopeptide 
of known sequence. A test mixture of candidate 
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polrieptides-is^then prepared: which includes a 
randomized portion" and includes also the : known; sequence 
that binds the bridging molecule. The bridging 
molecule binds the polypeptides to the target molecule 
in the vicinity of the anchor binding site. , SPERT is 
then -applied 3 to- select polypeptides^ whicha bind a 
surface of the target molecule* adjacent to the anchor 
binding site. Polypeptide ligands which bind to the 
target are isolated. Walking procedures as described 
above can then be applied to obtain polypeptide ligands 
with increased binding strength or increased 
specificity of binding to the target. Walking 
procedures could employ selections for binding to the 
anchor binding site itself or to another part of the 
target itself. This method is particularly useful to 
isolate polypeptide ligands which bind at a particular 
site within the target molecule. The anchor acts to 
ensure the isolation of polypeptide sequences which 
bind to the target molecule at or near the binding site 

of the anchor. 

Screens, selections or assays to assess the 
effect of binding of a polypeptide ligand on the 
function of the target molecule can be readily combined 
with the SPERT methods. (Specifically, screens for 
inhibition or activation of enzyme activity can be 
combined with the SPERT methods. 

in more specific embodiments, the SPERT method 
provides a rapid means for isolating and identifying 
polypeptide ligands which bind to nucleic acids and 
proteins, including enzymes, receptors, antibodies, and 
glycoproteins . 

In another aspect, the present invention 
provides a method for detecting the presence or absence 
of, and/or measuring the amount of a target molecule in 
a sample, which method employs a polypeptide ligand 
which can be isolated by the methods described herein. 
Detection of the target molecule is mediated by its 



binding to a ; polypeptide ligand .specif ic for that 
target molecule. The polypeptide, ligand .can be . 
labeled, for example radiolabeled or enzyme linked, to 
allow qualitative or quantitative detection, analogous 
to ELISA and RIA methods. The detection .method is 
particularly useful fon target .molecules ,which,.are 
proteins. The method is more particularly useful for 
detecting proteins which are known to be only weakly 
antigenic, or for which conventional monoclonal 
antibodies of a desired affinity are difficult to 
produce. Thus, polypeptide ligands of the present 
invention can be employed in diagnostics in a manner 
similar to conventional antibody-based diagnostics, 
one advantage of polypeptide ligands over conventional 
antibodies in such detection methods and diagnostics is 
that polypeptides are capable of being readily 
synthesized in vitro or after cloning, since the method 
of the invention concomitantly selects the means for 
amplification, e.g., coding nucleic acids, along, with 
the ligand itself. Alternatively, the polypeptide can 
be chemically synthesized since its amino acid sequence 
can be ascertained readily from the nucleotide sequence 
of its coding mRNA. A SPERT-generated polypeptide 
ligand need not be as large as an antibody molecule. 
Another advantage is that the entire SPERT process is 
carried out in vitro and does not require immunizing 
test animals. Furthermore, the binding affinity of 
polypeptide ligands can be tailored to the user's 
needs. Compared to antibodies, SPERT-generated ligands 
have much greater versatility. Conventional antibodies 
are immunoglobulins, which, although capable of a large 
repertoire of binding affinities, are nevertheless 
variations of a narrow amino acid sequence and 
structural theme. SPERT-generated polypeptide ligands, 
in contrast, are unlimited as to structural type, and 
therefore have virtually unlimited potential for 
binding. 







' * • Polypeptide ligands of small .molecule targets 
are useful as diagnostic assay reagents and have 
therapeutic uses as sequestering agents, drug delivery 
vehicles and modifiers of hormone action. Catalytic 
' ' 1 5 polypept ides are selectable products of this invention. 

§0$$ v^ For - e ^ple;-b7-selecting,- ; for^binding.ttq^transition 
W?*V state" analogs of an enzyme catalyzed reaction, 

: catalytic polypeptides can be selected. Catalytic 

immunoglobulins have been developed by raising 
10 antibodies to transition state analogs (Schultz, P.C. 
(1989) Angew. Chem. Int. 2d Engl. 2* 1283-1295; 
Schultz, P-G. (1989) Acc. Chem. Res. £2:287-294; 
'"' Pollack, S.J. et al. (1989) Math. Enzymol. 1Z&:551- 

::.^::J X . 568). 

15 in yet another aspect, the present invention 

' ' provides a method for modifying the function of a 

target molecule using polypeptide ligands which can be 
isolated by SPERT . Polypeptide ligands which bind to a 
target molecule are screened to select those which 
specifically modify function of the target molecule, 
for example to select inhibitors or activators of the 
function of the target molecule. An amount of the 
selected polypeptide ligand which is effective for 
modifying the function of the target is combined with 
the target molecule to achieve the desired functional 
modification. This method is particularly applicable 
to target molecules which are proteins. A particularly 
useful application of this method is to inhibit protein 
function, for example to inhibit receptor binding or to 
inhibit enzyme catalysis. In this case, an amount of 
the selected polypeptide molecule which is effective 
for target protein inhibition is combined with the 
target protein to achieve the desired inhibition. 

The term "reverse translation" is used 
throughout as shorthand for the concept of information 
flow from polypeptide sequence to nucleic acid 
sequence. The phrase and shorthand make reference to 
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fV ~. the original and revised, "central. dogma?' pronounced by 
Francis Crick many years ago.. Crick understood and 
articulated the idea that either RNA or DNA could serve 
as a template for the synthesis of complementary , 
5 nucleic acid sequences, and that chemically either RNA 
or DNA could serve as a. template for the synthesis of 
both RNA and DNA. Crick noted that proteins, comprised 
of strings of amino acids, were templated by nucleic 
acid but could not serve themselves as a template for 

10 the synthesis of nucleic acids. 

Most importantly, no simple chemistry is known 
that allows "reverse translation"; that was the basis 
nearly 25 years ago of Crick's adaptor hypothesis for 
using information in RNA to yield specified protein 

15 sequences during translation. 

SPERT has at its center a form' of reverse 
translation that does not conflict with Crick's 
postulates. While no process, no simple chemistry, is 
known that provides synthesis of a nucleic acid 

20 containing a sequence specified by a polypeptide (whose 
sequence is unknown to the scientist at the time of 
reverse translation) , SPERT provides a reliable 
mechanism for amplifying and using mRNAs that encode 
polypeptides of desired function but of unknown 

25 sequence. Techniques for binding one or a few 

polypeptides to a selected target are known in the art, 
although binding of a small number of polypeptides from 
a randomized pool of polypeptides is of no value by 
itself. It is the concomitant selection in the 

30 ribosome complex or mRNA • polypeptide copolymer of the 

mRNAs that encode those very polypeptides that provides 
a form of reverse translation because: 

1) the selected coding sequences can be 
amplified to yield large quantities of both DNA and 

35 RNA; 

2) the newly made mRNA can be used for 

synthesizing polypeptides, now a smaller set than the 
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original' rahdokized ^xture^of . polypeptides from which 
non-binding, or poorly-binding polypeptides have been 

removed, and; 

3) the polypeptides held in ribosome 
complexes or mRNA* polypeptide copolymers can be used 
for" 'a subsequent round" "of SPERTV- — - - 

Finally/ "reverse translation" during SPERT 
does not yield a nucleic acid from only polypeptide 
sequence, but "reverse translation" does provide 
(through amplification techniques) net synthesis of the 
templates from which the desired polypeptide was 
synthesized. In principle a single molecule of 
polypeptide of the desired activity, along with a 
single template RNA in the translation complex or 
copolymer, will lead to a nanomole or even a micromole 
of nucleic acid corresponding to that polypeptide 
sequence. This net synthesis of nucleic acids based on 
the partitioning and activity of the desired 
polypeptide is an effective quantitative reverse 
translation that provides the materials for subsequent 

rounds of SPERT. 

Also, the coding sequence can be used to deduce 
the amino acid sequence of a selected polypeptide. The 
polypeptide can then be synthesized by chemical 
methods, if desired. 

BPTKF DFsraTPTTQM r>V THE FIGURES 

Figure 1 is a diagrammatic representation of 
steps in the process of the invention. The top panel 
depicts a double-stranded DNA template having a T7 
promoter ("T7 PRO" ) and a segment of randomized 
sequence, represented as "nnn......", preceded by a 

start codon, ATG. The initiation site of transcription 
and direction of transcription are shown as a vertical 
line labeled and an arrow, respectively. In vitro 

transcription creates mRNAs (2nd panel) which contain, 
from left to right, a ribosome binding site, a 
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randomized • sequence region, a„ 3 ' fixed sequence region, 
and a 3' primer annealing site. In vitro translation 
of this mixture gives rise to ribosome complexes with 
randomized nascent polypeptides (3rd panel) . The, 

5 ribosome complexes . are subjected to selection for . 
affinity, of : the. nascent t polypeptide^ and a^ desired 
i target, molecule (bottom panel). The. encoding mRNAs of 
the partitioned complexes are purified and subjected to 
amplification, e.g., by reverse transcription, PCR and 

LO transcription, to generate mRNAs for a second cycle of 
the process. 

Figure 2 is a diagram showing expanded views of 
a ribosome complex. The top panel is a ribosome 
complex as in the third panel of Figure 1. A cut-away 

15 view of the ribosome (2nd panel) shows 30-40 amino 

acids of the nascent polypeptide buried in the complex 
and unavailable for interaction with the solvent. The 
ribosome is depicted with two shades of gray to 
indicate inner and outer regions. The nascent 

20 polypeptide is depicted as a thick white line extending 

vertically from a central tunnel (black) near the 
center of the ribosome. That portion inside the 
ribosome is depicted as 30-40 amino acids in length. 
The carboxy-termina]jj end of the nascent polypeptide is 

25 shown connected to a peptidyl-tRNA (curly black line) . 

The region bordered by a dotted line is expanded in the 
bottom panel showing that the nascent polypeptide is 
covalently linked to a transfer RNA molecule which is 
hydrogen-bonded to the mRNA at a codon in the P-site. 

30 Figure 3 is a diagram that represents 

partitioning polypeptide ligands by direct 
immunoprecipitation. The top panel is a ribosome 
complex as in Figure 1. The center panel depicts 
several ribosome complexes where the nascent 

35 polypeptide is represented as a short, thick white line 
with hatching to indicate the segment of randomized 
sequence. Molecules of a first antibody 
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(i^unogiobuiin) are represented as inverted Y-shaped 
structures drawn with heavy, straight black lines, 
interaction (binding) of a nascent polypeptide with the 
epitope recognition site of an immunoglobulin is shown 
for two ribosome complexes. Nascent polypeptides are 
" -selected that^kve affinity' for; immunoglobulin - 
' "molecules 1" The bottom panel shows addition of a second 
antibody (white inverted Y's) generally reactive to the 
first immunoglobulin resulting in an immunoprecipitate 
containing the selected ribosome complexes, shown as a 
cluster in the left half of the panel . 

Figure 4 is a diagram showing partitioning of 
polypeptide ligands by indirect immunoprecipitation. 
The top panel shows a target protein which has an 
immunoreactive domain ("handle") and a target domain 
("pan"). Three types of ribosome complexes are 
depicted in the second panel. Those with no affinity 
for the target protein are shown in white. Those with 
affinity for the "pan" are shown in light gray labeled 
with a "P" and shown with a bound target protein 
attached by the "pan" to the nascent peptide. Those 
with affinity for the "handle" are dark gray, labeled 
with an »H" and shown with a bound target protein 
attached by the "handle'^ to the nascent peptide. In 
the third panel, a first antibody (black lines) 
directed against the "handle" either displaces ligand 
associations of the »H" complexes or those complexes 
are unreactive. The first antisera form a sandwich 
with the "P" complexes made up of a ribosome complex 
associated with the target protein, through its "pan", 
and bound to the first immunoglobulin through the 
"handle". These "P" complexes are immunoprecipitated 
by second antisera directed against the primary 
antisera, as shown in the bottom panel. 

Figure 5 is a diagram showing selection of 
polypeptide ligands by membrane partitioning. The top 
panel shows a ribosome complex as in Figure 1. The 
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middle panel* shows ribosome complexes and membrane 
vesicles, with membrane proteins . . . The' membrane vesicles, 
are depicted as a hatched band interrupted by hatched 
ovals that depict membrane proteins embedded in the 
membrane. . In the middle panel, ribosome complexes are 
shown . binding with .membrane protean ^ so .that, the nascent 
polypeptides having binding affinity for a membrane 
protein are partitioned. The bottom panel depicts 
three ribosome complexes bound to a membrane vesicle, 
forming a large complex which is separable from unbound 
ribosome complexes. 

Figure 6 is a diagram showing partitioning of 
polypeptide ligands by affinity column chromatography. 
Ribosome complexes (top panel) are passed through a 
column containing insoluble support materials to which 
have been bonded target molecules. The middle panel is 
an expanded view of the column showing support 
materials (hatched circular segments) with attached 
target molecules (black bars) to which some ribosome 
complexes are bound. The bottom panel shows, enlarged, 
3 single ribosome complex in which the nascent 
polypeptide (light shading) is bound to a target 
molecule which is attached to a column support bead 
(hatched) . Ribosome complexes with high affinity to 
the target molecules are retained on the column and 
subsequently eluted to continue with SPERT. 

Figure 7 is a diagram showing anchoring of a 
binding epitope and secondary ligand evolution. A 
molecule ("inhibitor") of known affinity for a target 
site on a protein is covalently linked to a "guide 
epitope". The guide epitope is any molecule for which 
there exists a peptide ligand, including a portion of a 
monoclonal antibody which contains an epitope 
recognition domain (Fab fragment) . The mRNA encodes a 
reactive peptide sequence that binds the guide epitope, 
incorporated into the nascent polypeptide. The bottom 
panel depicts a ribosome complex having a nascent 
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mm polypeptide that includes the reactive, guide binding, 

segment (shaded) and a randomized segment (unshaded) . 
' The ribosome complex is shown bound to the protein of 
interest by a binding interaction between the guide 
a epitope and the reactive segment and by a secondary 
* 1 binding interaction' between-the randomized- segment and 

K ^ a neighboring site on the target protein of interest.. 

The randomized portion of the nascent polypeptide is 
free to evolve interactions with secondary sites on the 

10 target protein. 

Figure 8 is a diagram which shows the DNA to be 
transcribed and the relationships of the 
oligonucleotides of Tables 1 and 2 in the DNA, prior to 
inserting the randomized sequence. The depicted 
structure constitutes a cassette for carrying out the 
transcription, translation, reverse transcription and 
PCR processes used in SPERT. 
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The following terms are used herein according 
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to the definitions. 

Polypeptide is used herein to denote any string 
of amino acid monomers capable of being synthesized by 
an in vitro translation system. The term also embraces 
post-translational modifications introduced by chemical 
or enzyme-catalyzed reactions, as are known in the art. 
Such post-translational modifications can be introduced 
prior to partitioning, if desired. Unless specified 
herein, all amino acids will be in the L-stereoisomeric 
30 form. Amino acid analogs can be employed instead of 

the 20 naturally-occurring amino acids. Any amino acid 
analog that is recognized by an aminoacyl-tRNA 
synthetase can be employed. Several such analogs are 
' known, including fluorophenyl alanine, norleucine, 

azetidine-2-carboxylic acid, S-aminoethyl cysteine, 4- 
methyl tryptophan and the like. 

Ligand means a polypeptide that binds another 
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molecule .(target) • In a population of candidate 
polypeptides, a ligand is one which binds with greater 
affinity than that of the bulk population. In a 
candidate mixture there can exist more than one Ugand 
5 for a given target. The ligands can differ from one ( 
another in their, binding affinities for., the garget 

I molecule. 

Candidate mixture is a mixture of nucleic acids 
and of polypeptides of differing sequence, from which 

10 to select a desired coding sequence and/or a desired 

ligand. The candidate mixture of nucleic acids serving 
as source of a candidate mixture of polypeptides can be 
ill vitro transcription products of natural ly-occurring 
nucleic acids or fragments thereof, chemically 

15 synthesized nucleic acids, enzymatically synthesized 

nucleic acids or nucleic acids made by a combination of 
the foregoing techniques. Target molecule means any 
compound of interest for which a ligand is desired. A 
target molecule can be a protein, fusion protein, 

20 peptide, enzyme, nucleic acid, nucleic acid binding 
protein, carbohydrate, polysaccharide, glycoprotein, 
hormone, receptor, receptor ligand, cell membrane 
component, antigen, antibody, virus, virus component, 
substrate, metabolite, transition state analog, 

25 cofactor, inhibitor, drug, controlled substance, dye, 
nutrient, growth factor, toxin, lipid, glycol ipid, 
etc., without limitation. 

Partitioning means any process whereby ribosome 
complexes or mRNA- polypeptide copolymers bound to 

30 target molecules, termed complex-target pairs herein, 
can be separated from ribosome complexes or 
mRNA • polypeptide copolymers not bound to target 
molecules. Partitioning can be accomplished by various 
methods known in the art. The only requirement is a 

35 means to separate complex-target pairs from unbound 
ribosome complexes or mRNA* polypeptide copolymers. 
Columns which selectively bind complex-target pairs but 
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not ribbsome complexes or mRNA • polypeptide copolymers, 
(or' specifically retain ligand. to an immobilized 
target) can be used for partitioning. A membrane or 
membrane fragment having the target on its surface can 
bind ligand-bearing ribosome complexes or 
" mRNA-polypeptide copolymers forming ; the basis .of a 
partitioning based on particle size. The choice of 
partitioning method will depend on properties of the 
target and of the complex-target pairs and can be made 
according to principles and properties known to those 
of ordinary skill in the art. 

Amplifying means any process or combination of 
process steps that increases the amount or number of 
copies of a molecule or class of molecules. Amplifying 
coding mRNA molecules in the disclosed examples is 
carried out by a sequence of three reactions: making 
cDNA copies of selected mRNAs, using polymerase chain 
reaction to increase the copy number of each cDNA, .and 
transcribing the cDNA copies to obtain an abundance of 
mRNA molecules having the same sequences as the 
selected mRNAs. Any reaction or combination of 
reactions known in the art can be used as appropriate, 
including direct DNA replication, direct mRNA 
amplification and the like, as will be recognized by 
those skilled in the art. The amplification method 
should result in the proportions of the amplified 
mixture being essentially representative of the 
proportions of different sequences in the mixture prior 

to amplification. 

Specific binding is a term which is defined on 
a case-by-case basis. In the context of a given 
interaction between a given ligand and a given target, 
a binding interaction of ligand and target of higher 
affinity than that measured between the target and the 
candidate ligand mixture is observed. In order to 
compare binding affinities, the conditions of both 
binding reactions must be the same, and should be 
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comparable to the conditions of the \ intended use. For 
the most accurate comparisons,, measurements will be 
made that reflect the interaction between ligand as a 
whole and target as a whole. The polypeptide ligands 
5 of the invention can be selected to be as.specifio as 
required, either by establishing selection .conditions 
that demand the requisite specif icity .during SPERT, or 
by tailoring and modifying the ligands through 
"walking" and other modifications using iterations of 
10 SPERT. 

Randomized is a term used to describe a segment 
of a nucleic acid or polypeptide having, in principle 
any possible sequence over a given length. Randomized 
nucleic acid sequences will be of various lengths, as 
15 desired, ranging from about twelve to more than 300 
nucleotides. The chemical or enzymatic reactions by 
which random sequence segments are made may not yield 
mathematically random sequences" due to unknown biases 
or nucleotide preferences that may exist. Redundancy 
20 of the genetic code, and biases in the tRNA content of 
an in vitro translation system can introduce additional 
bias in the translated amino acid sequences. 
Introducing a deliberate bias into a randomized coding 
region can reduce the| bias of the resulting translated 
25 amino acid sequence. The term "randomized" is used 

instead of "random" to reflect the possibility of such 
deviations from non- ideality. In the techniques 
presently known, for example sequential chemical 
synthesis, large deviations are not known to occur. 

A bias may be deliberately introduced into a 
randomized sequence, for example, by altering the molar 
ratios of precursor nucleoside (or deoxynucleoside) 
triphosphates of the synthesis reaction. A deliberate 
" bias may be desired, for example, to improve the 
35 randomness of amino acid sequence of translated 

polypeptides or to lower the frequency of appearance of 
certain amino acids. 
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. For example* a randomized sequence .biased for 
codons of the form ARN (where A is Adenine, R is 
Adenine or Guanine and N is any nucleotide) the most 
commonly encoded amino acids are basic (Arg, Asn, Lys) 
or polar (Ser) . Randomized sequences biased for codons 
of the form GRN are biased for acidic amino acids, Asp 
(GAU, GAC) 'and Glu : (GAA, GAG) , and Glycine (GGN) . 
Randomized sequences in which U is never the 1st base 
in the triplet codon will lack termination signals and 
will not encode amino acids Phe, Tyr, Cys and Trp. By 
such strategies, randomized coding sequences can be 
biased for the type of structure likely to bind a given 
target. For example, polypeptide sequences biased for 
acidic amino acids can bind cationic target molecules 
more easily than completely random polypeptides. 

Translatable mRNA is RNA which' possesses all 
requisite sequences for translation in a conventional 
ia vitro translation system. These include / proper 
orientation and sequence proximal to the 5' end of the 
RNA, a ribosome binding site and an initiation codon. 
in prokaryotes, as is known in the art, other codons, 
such as UUG and GUG can serve as initiation codons and 
encode methionine if properly spaced within a ribosome 

binding site. | 

Ribosome binding site means a nucleotide 
sequence in the mRNA which functions as a binding site 
for a ribosome in an in vitro translation system. The 
sequences which function as ribosome binding sites 
differ depending on whether the ribosomes are of 
procaryotic or eucaryotic origin, as is known in the 
art. in procaryotic systems, the ribosome binding site 
is a short purine-rich region, with a sequence such as 
GAGG or AGGA, usually located about 5-12 bases 5' to 
• the initiation codon. The translation initiation codon 
is therefore usually located within 5-12 bases from 
the ribosome binding site in the 3' direction on the 
mRNA. These sequences are sometimes termed a Shine- 
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Dalgamo ' sequence. r The structures, of ribosoine binding 
sites and their proper placement to ensure correct 
initiation of protein synthesis are well known in the 
art. 

Initiation codon is a characteristic 
trinucleotide sequence^ ,AUG. which encode^ methionine and 
which encodes a first amino acid of an' encoded' 
polypeptide and also sets the codon reading frame for 
the nucleotide sequence in the 3' direction from the 
initiation codon. 

Ribosome complex is a macromolecular complex 
including at least one ribosome, attached mRNA molecule 
and, for each ribosome, a nascent polypeptide attached 
via tRNA to the ribosome. The nascent polypeptide has 
an amino acid sequence encoded by the attached mRNA. 
Ribosome complexes are formed, as is known in the art, 
during protein synthesis. Ribosome complexes are 
stable if they become stalled for any reason, for 
example, by depletion of release factor, lack of 
termination codon in the message, lack of a charged 
tRNA, etc., as known in the art. The mRNA together 
with attached ribosome (s) and nascent peptide (s) remain 
stably bound and can be isolated together, using 
methods known in the art. 

mRNA 'polypeptide copolymer is a macromolecular 
complex including an mRNA and a polypeptide having an 
amino acid sequence encoded by the attached mRNA. 
According to one embodiment of the invention, 
mRNA • polypeptide copolymers are formed by the creation 
of a candidate mixture in which the RNA includes fixed 
sequences and/or chemical modifications in both 
non-translated and translated regions so that a portion 
of the translated polypeptide will link with a portion 
of the mRNA via a covalent bond or tight affinity 
interaction. In other embodiments, the translated 
polypeptides or tRNA species utilized may be modified 
as well to facilitate the formation of mRNA • polypeptide 
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" or inl^' tRNA« polypeptide co^ 

in vitro translation can be carried out using 
known systems. These well-known translation systems 
are the fc. fifiU system, the wheat germ system, and the 
5 rabbit reticulocyte system. The latter is available 
' commercially The^' conditions for carrying out in vitro 
translations "are well-known in the art, and- various . 
modifications, adaptations and optimizations are 
available to those skilled in the art. 

The combination of translatable mRNA encoding a 
polypeptide and in vitro translation system constitute 
amplifying means for amplifying the quantity of 
polypeptide encoded by the mRNA. The mRNA can itself 
be amplified using reverse transcription, PCR with 
15 appropriate primers and an RNA polymerase. The 

amplified mRNA can serve for in vitro synthesis of 
desired quantities of the encoded polypeptide. As 
noted,' saara, this' process constitutes reverse 
translation. 

The terms "ribosome" and "nascent peptide" have 
conventional meanings known in the art. The term 
-translated mRNA" simply refers to mRNA present in a 
ribosome complex, either wholly or partially 

translated. 

Ribosome complex-target pairs are ribosome 
complexes of which the nascent polypeptide component is 
bound to a target molecule. The target molecule can be 
free in solution or bound to a solid support matrix. 

Homology is used to compare the related uses of 
sequences. Percent amino acid sequence homology is 
measured by comparing sequences of equal length 
position by position. The percent of those positions 
occupied by the same amino acid in two polypeptides is 
"the percent sequence homology. Thus, given peptide 
ABCDE as a naturally-occurring comparison peptide, 
peptides ABCDX or ABXDE are 80% homologous but peptides 
ABXYZ, AXYZE and XYZDE are 4 0% homologous and peptides 
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1 EDCBA,' BDAEC , MNOPQ are non-homologous. 

The St>ERT method involves the combination pf a ■ 
selection of polypeptide ligands which bind to a target 
molecule, for example a protein, with amplification of 
those selected polypeptides via the attached mRNAs. 
Iterative cycling of the selection/ .amplification steps 
allows selection of one or a small number of 
polypeptides which bind most strongly to the target 
from a pool which contains a very large number of 
nucleic acids and hence encoded polypeptides. 

Cycling of the selection/ amplification 
procedure is continued until a selected goal is 
achieved. For example, cycling can be continued until 
a desired level of binding of the polypeptides in the 
test mixture is achieved or until a minimum number of 
polypeptide components of the mixture* is obtained (in 
the ultimate case until a single species remains in the 
test mixture). In many cases, -it will be desired to 
continue cycling until no further improvement of 
binding is achieved. It may be the case that certain 
test mixtures of polypeptides show limited improvement 
in binding over background levels during cycling of the 
selection/ amplification. In such cases, the sequence 
and length variation in the test mixture should be 
increased until improvements in binding are achieved. 
Anchoring protocols and/or walking techniques can be 
employed as well. 

Specifically, the method requires the initial 
preparation of a test mixture* of candidate 
polypeptides. A translatable mRNA mixture is prepared, 
each member of the mixture including in its nucleotide 
sequence a ribosome binding site, an initiation codon 
and a randomized coding region. Preferably the 

• individual mRNA 1 s contain a randomized region flanked 
by sequences conserved in all nucleic acids in the 
mixture. The conserved regions are provided to 
facilitate amplification of selected nucleic acids. 
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Since there are many such sequences known in the art, 
the'choice of sequence is one which those of ordinary 
"'^V? skill in the art can make, having in mind the desired 

method of amplification. The randomized coding region 
v'> ' ' 5 can have a fuily or partially randomized sequence 

0;&f ■ ■ according to the desired translation product.- 

Depending oii the desired polypeptide, structure, the . 
'A ' coding portion of the nucleic acid can contain 

■•' ' " subportions that are randomized, along with subportions 

which are held constant in all nucleic acid species in 
the mixture. For example, sequence regions known to 
code for amino acid sequences that bind, or have been 
selected for binding, to the target can be integrated 
with randomized coding regions to achieve improved 
binding or improved specificity of binding. Secpence 
variability in the polypeptide test mixture can also be 
introduced or augmented by generating mutations in the 
coding mRNA's during the selection/ amplification • 
process. In principle, the mRNA's employed in the test 
mixture can be any length as long as they can be 
amplified. The method of the present invention is most 
practically employed for selection from a large number 
of sequence variants. Thus, it is contemplated that 
the present method will preferably be employed to 
assess binding of polypeptide sequences ranging in 
length from about four amino acids to any attainable 



15 



20 



25 



size 



30 



35 



The randomized portion of the coding nucleic 
acids in the test mixture can be derived in a number 
of ways. For example, full or partial sequence 
randomization can be readily achieved by direct 
chemical synthesis of the nucleic acid (or portions 
thereof) or by synthesis of a template from which the 
nucleic acid (or portions thereof) can.be prepared by 
use of appropriate enzymes. Chemical synthesis 
provides the advantages of being precisely, controllable 
as to length and allowing individual randomization at 
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each' triplet position.* A commercial DNA- synthesizer 
can be used, either with an equivalent mixture of ,the 
four activated nucleotide substrates or with a biased 

mixture. Alternatively, the synthesizer can be set up 
to provide a limited nucleotide selection at a given 
position,' e.g. , • only A at the first,, triplet ^position. 
End addition, catalyzed by terminal transferase in the 
presence of nonlimiting concentrations of all four 
nucleotide triphosphates can add a randomized sequence 
to a segment. Sequence variability in the coding 
nucleic acids can also be achieved by employing size- 
selected fragments of partially digested (or otherwise 
cleaved) preparations of large, natural nucleic acids, 
such as genomic DNA preparations or cellular RNA 
preparations. In those cases in which randomized 
sequence is employed, it is not necessary (or possible 
from long randomized segments) that the test mixture 
contains all possible variant sequences. It will 
generally be preferred that the test mixture contain as 
large a number of possible sequence variants as is 
practical for selection, to insure that a maximum 
number of potential amino acid sequences of the 
translated polypeptide are identified. A randomized 
sequence of 60 nucleosides will contain a calculated 
10 36 different candidate nucleic acid sequences which 
would encode 10 26 possible decapeptides . As a 
practical matter, it is possible to sample only about 
10 18 polypeptide candidates in a single selection. 
Therefore, candidate mRNA mixtures that have randomized 
segments longer than 60 contain too many possible 
sequences for all to be sampled in one selection. Many 
epitotes recognized by antibodies are only 5-10 amino 
acids in length. It is not necessary to sample all 

" possible sequences of a candidate mixture to select a 
polypeptide ligand of the invention. It is basic to 
the method that the coding nucleic acids of the test 
mixture are capable of being amplified. Thus, it is 
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'^Ife - < preferred that any conserved- regions employed ,xn the 

test nucleic acids do not contain sequences which 



10 



interfere with amplification. 

• •' ,• The practical considerations that limit the 

• ' ' ' • 5 number of candidates that may be sampled include the 

. volume or mass of materials that; pan be handled in a 
laboratory environment. A system that- operates to form 
ribosome complexes requires a stoichiometric amount of 
ribosome in the translation mixture. The presence of 
this quantity of ribosomes severely limits the amount 
of sequences that can be sampled — to about 10 12 to 
10 u complexes. The production and isolation of 
quanitites of ribosomes in excess of these amounts 
would be impractical. As p. coU has only about 10* 
ribosomes per cell, a huge amount of E- coU would be 
required to produce stoichiometric amounts of 
ribosomes. The limitation of 10 12 to 10 M complexes is 
higher than the limitations found in other systems. that 
have been devised for sampling large numbers of 
randomized polypeptides. However, when the ribosome is 
not bound up in the ribosome complex but is free to 
translate a large number of mRNA species in the 
reaction mixture, the number of mRNA species that can 
be practically tested at ( a time rises to at least about 
10 17 to 10 18 different candidate sequences, depending on 
the number of mRNAs translated by a single ribosome. 

The complex of a ribosome, mRNA, and nascent 
polypeptide attached to a tRNA in the P-site of the 
ribosome is very stable. Release of the nascent 
peptide from the complex and of the mRNA from the 
ribosome requires protein release factors. Release 
factor recognition requires the positioning of the stop 
codons of the mRNA in the A-site of the ribosome. In 
the absence of a stop codon or release factor the 
dissociation of the translation complex from mRNA is 
very slow. The addition of the antibiotics 
cycloheximide (eukaryotic systems) and chloramphenicol 
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(prokary 6 tic system) further stabilizes the- complexes 
so that extensive manipulations, t1 ike, column 
chromatography and gradient centrifugation can be 
performed. 

5 In this embodiment a ribosome is preferably 

o. .. paused at? ; the .end of a , coding sequence, on a mRNA with 
the encoded nascent polypeptide available for 
partitioning of the complex- There are a number of 
ways in which this can be accomplished. Because stop 
10 codons are essential for release factor action, a 

translating ribosome that does not encounter any stop 
codons will proceed to the end of a mRNA and stall at 
the 3' end (Connolly and Gilmore, supra ) . In Vltrp 
translation systems which have been depleted of release 
15 factor (by iramunoinactivation or mutation) will result 

in the stalling of translation complexes at stop 
codons. Removal of GTP, the use of non-hydrolyzable 
analogues, and the use of certain antibiotics will also 
stall translational complexes. The timed addition of 
2 0 these exogenous factors to a synchronous ij\ vitro 

translation reaction can produce predictable sizes of 
nascent polypeptide for the successful partitioning of 
the translational complex. In some organisms there 
exist temperature-sensitive tRNA synthetase mutants. 
25 Another way of stalling translational complexes at 

defined sites is to include at the 3' end of the coding 
region a stretch of sense codons which are recognized 
by a single species of tRNA for which there exists a 
conditional tRNA synthetase mutant. In vitro 
30 translation reactions done from extracts of such 

mutants under the restrictive condition will result in 
stalled complexes at the stretch of sense codons for 
that particular tRNA. 

It will be understood that it is not necessary 
35 to stall or pause the translation process to obtain 

partitionable ribosome complexes. Stable complexes can 
be isolated at any time during active translation. It 
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•> is 'advantageous to isolate actively; translating 
ribosome complexes "when it Is • desired to vary the 
length of the randomized segment, e.g., to test the 
effects of polypeptide length on binding efficacy. 
5 Ribosome complexes isolated during active translation 
constitute a^population of nascent peptides., of varied 
length. By' synchronously initiating. translation and . 
isolating ribosome complexes at various times 
thereafter, the effects of increasing polypeptide 
10 length can be compared. 

Polymerase chain reaction (PCR) is an exemplary 
method for amplifying nucleic acids. Descriptions of 
PCR methods are found, for example in Saiki et al. 
(1985) Science 23J2: 1350-1354 ; Saiki et al. (1986) 
Nature 321:163-166; Scharf et al. (1986) Science 
233:1076-1078; Innis et al. (1988) Pr6c. Natl. Acad. 
Sci. 85:9436-9440; and in U.S. Patent 4,683,195 (Mullis 
et al.') and U.S. Patent 4,683,202 (Mullis et al.). - In 
its basic form, PCR amplification involves repeated 
cycles of replication of a desired single-stranded DNA 
(or cDNA copy of an RNA) employing specific 
oligonucleotide primers complementary to the 3' ends of 
both strands, primer extension with a DNA polymerase, 
and DNA denaturation . Products generated by extension 
from one primer serve as templates for extension from 
the other primer. A related amplification method 
described in PCT published application WO 89/01050 
(Burg et al.) requires the presence or introduction of 
a promoter sequence upstream of the sequence to be 
amplified, to give a double-stranded- intermediate. 
Multiple RNA copies of the double- stranded promoter- 
containing intermediate are then produced using RNA 
polymerase. The resultant RNA copies are treated with 
reverse transcriptase to produce additional double- 
stranded promoter containing intermediates which can 
them be subject to another round of amplification with 
RNA polymerase. Alternative methods of amplification 
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includS ainong 'others, cloning of selected DNAs, ,pr cDNA 
copies of selected RNAs into an appropriate vector, and 
introduction of that vector into a host organism where 
the vector and the cloned DNAs are replicated and. thus 
5 amplified (Guatelli, J.C. et al. (1990) Proc. Natl. 

Acad.' sci. * 87:1874). -Mnvgeneral, any means that will 
allow faithful, efficient amplification of selected 
nucleic acid sequences can be employed in the method of 
the present invention. It is only necessary that the 
10 proportionate representations of sequences after 
amplification reflect the relative proportions of 
sequences in the mixture before amplification. 

Specific embodiments of the present invention 
for amplifying RNAs are based on Innis et al. (1988) 
15 supra. The RNA molecules in the test mixture are 

designed to contain a sequence transcribed from a T7 
promoter in their 5- portions. Full-length cDNA copies 
of selected mRNA molecules are made using reverse 
transcriptase primed with an oligomer complementary to 
20 the 3' sequences of the selected RNAs. The resultant 

cDNAs are amplified by lajg DNA polymerase chain 

• extension, employing a primer containing the T7 
promoter sequence as well as a sequence complementary 
to the conserved 5' and of the selected RNAs. Double- . 

25 stranded products of this amplification process are 

then transcribed in vitro. Transcripts are used in the 
next selection/ amplification cycle. The method can 
optionally include appropriate nucleic acid 
purification steps. 
30 m general, any protocol which will allow 

selection of polypeptides based on their ability to 
bind specifically to another molecule, i.e., a protein 
or any target molecule, can be employed in the method 

• of the present invention. It is only necessary that 
35 the ribosome complexes or mRNA- polypeptide copolymers 

be partitioned without disruption such that the 
selected coding mRNA ' s are capable of being amplified. 
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-For'example, in a column: binding .selection , in, which a 
test mixture of ribosome complexes bearing nascent 
randomized polypeptide is passed over a column of 
immobilized target molecules, the complexes bearing 
5 polypeptide iigands of the target are retained and the 
non-target binding complexes are eluted. from the column 
with appropriate .buffer. : A wiide variety of affinity . 
chromatography techniques, including support matrices 
and coupling reactions is available for application of 
10 a column partitioning system. Target binding 

polypeptides together with mRNA's encoding each remain 
bound to the column. The relative concentrations of 
protein to test polypeptides in the incubated mixture 
influences the strength of binding that is selected 
for. When polypeptide is in excess, competition for 
available binding sites occurs and those polypeptides 
which bind most strongly are selected. Conversely, 
.when an excess of target is employed, it is expected 
that any polypeptide that binds to the target will be 
selected. The relative concentrations of target to 
polypeptide employed to achieve the desired selection 
will depend on the type of target, the strength of the 
binding interaction and the level of any background 
binding that is present. The relative concentrations 
needed to achieve the desired partitioning result can 
be readily determined empirically without undue 
experimentation. Similarly, it may be necessary to 
optimize the column elution procedure to minimize 
background binding. Again such optimization of the 
elution procedures is within the skill of the ordinary 
artisan. 

An unexpected feature of the invention is the 
fact that the polypeptide ligand need not be elutable 
from the target to be selectable. This is because it 
is the mRNA that is recovered for further amplification 
or cloning, not the polypeptide itself. It is known 
that some affinity columns can bind the most avid 



ligands so tightly as to be very.dif f icult to elute. 
However the method of the invention can be successfully 
practiced to yield avid ligands, even covalent binding 
ligands. Ribosome complexes can be disrupted by., 
denaturing agents such as urea or sodium dodecyl . 
sulfate, without effecting the, integrity pt, the mRNA. 
Various mRNA •polypeptide copolymers , may be separated 
into their component units based on the specific nature 
of linking between the RNA and the associated 
polypeptide. The mRNA' s of selected ligands are 
amplified, as described elsewhere herein, to yield a 
mixture of coding sequences enriched for those that 
encode polypeptide ligands of the desired target, 
including ligands that bind tightly, irreversibly or 
covalently. 

Immunoreactivity of nascent polypeptides on 
ribosome complexes or mRNA* polypeptide copolymers can 
be used to purify the encoding. mRNAs . In one 
embodiment, ribosome complexes are purified from cells 
in the presence of inhibitors such as chloramphenicol 
or cycloheximide which stall translational complexes on 
mRNA, Binding of antibodies which recognize the 
epitope of interest followed by binding antibodies 
which recognize those antibodies results in 
immunoprecipitati^on of the ribosome complexes 
containing the mRNAs which encode the epitope. The 
background of mRNAs which do not encode the epitope of 
interest but are trapped by the immunoprecipitated 
complex can be lowered by using purified IgGs against 
the epitope followed by purification of the 
immunoreactive ribosomes on a protein A column. (IgGs 
are one class of the soluble immunoglobulins which 
compose antisera. Protein A is derived from 
sta phylococcus aureus and has a high affinity for IgGs. 
Protein A binding does not interfere with epitope 
recognition. ) 

These procedures for immunoprecipitation to 
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partition ribosome complexes or mRNA- polypeptide 
copolymers can be used in a variety of modifications to 
partition the translational complexes in SPERT. One 
such modification is termed "panhandling" (See Figure 
5 4 ). A protein is composed of an immunoreactive domain 
' for which known antibody exists, and a separate target 
domain for which' one wishes to evolve protein ligands.. 
Ribosome complexes or mRNA • polypeptide copolymers which 
interact with the target domain (the "pan") via their 

10 nascent polypeptides will be immunoprecipitated upon 
binding antibodies which recognize the immunoreactive 
domain (the "handle"). This modification is especially 
useful for developing polypeptide ligands against a 
segment of a fusion protein in which the amino terminus 

15 is the fragment of a common protein (beta- 

• galactosidase, for example) and the carboxyl-terminal 
portion is the protein of interest. It will also be 
useful' for the development of polypeptide ligands which 
recognize immunoresistant domains of a protein which 

20 has an immunodominant domain for which polyclonal sera 
is available. Where immunoprecipitation is employed, 
it will be advantageous to discard any ribosome 
complexes or mRNA • polypeptide copolymers that react 
directly with the antibodies, prior to selection. 

25 Alternative partitioning protocols for 

separating polypeptides bound to targets, particularly 
proteins, are available to the art. For example, 
binding and partitioning can be achieved by 
immunoprecipitation of the test ribosome complex 

30 mixture or test mRNA* polypeptide copolymers mixture and 
passing the immune complexes through a protein A 
affinity column which retains the immune reactive 
polypeptide-containing complexes as the column. Those 
" mRNA 1 s that encode a polypeptide that binds to the 

35 target antibody will be retained on the column as part 
of the ribosome complex or mRNA* polypeptide copolymer 
and unbound coding mRNA ' s can be washed from the 
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Interestingly; protein loops may be a powerful 
location for randomization and SPERT-based isolation of 
novel ligands. When inspecting protein structures in 
5 detail, only secondary structures are predictable; _ 
those structures include alpha helices., and- beta sheets 
or multiple strands, -and either structure can be formed 
with parallel or anti-parallel peptides. The 
connectors between such secondary structures, called 

10 loops or hairpins, are related to RNA hairpin loops and 
RNA pseudoknots in that the locations of the ends of 
the loops are set by the secondary structures but the 
exact loop structures are idiosyncratic and dependent 
on the loop primary sequences and contacts with other 

15 elements of the protein. Loop sequences, when 

randomized and put through SPERT should provide vast 
structural libraries. Disulfide bonds between 
cysteines represent another means by which to construct 
loops; similarly, zinc fingers and copper or other 

20 metal "fists" also provide other kinds of loops. 

Effective partitioning can be carried out with 
pure or impure target preparations. In cases where 
target preparations are impure, selectivity can be 
enhanced by strategies that enhance the binding of 

25 ligands to the desired target, or which specifically 
elute desired ligands or prevent their binding. The 
latter approach is subtractive. A known ligand can 
block binding of any polypeptide that can bind the 
target so that the desired polypeptide is partitioned 

3 0 by elution and unwanted polypeptides are retained on 
the column. 

Optionally, chemical or enzymic modifications 
of the polypeptide can be introduced post- 
- translationally . The process for making such 
35 modifications should not disrupt the ribosome complexes 
or mRNA* polypeptide copolymers. An important type of 
post-translational modification is oxidation to form 



disulfides in sequences that contain two or more 
cysteines. Particularly for small polypeptides, 
disulfide bonds are especially advantageous to lock in 
a desired conformational state so that a rigid 
structure having high specificity and binding affinity 
for a target can be achieved. - (See, -e.g., Olivera, B. 
K. r et al. " (1990) science aii: 257-263.-. 

other forms of post-translational structure 
modifications include introducing factors that non- 
covalently influence tertiary structure of the nascent 
polypeptide. In particular, metal ions such as Ca", 
Mg", Mn~, Zn~, Fe~ Fe~, Cu~ and Mo 6 * can affect 
polypeptide folding configuration by forming 
coordination complexes with amino acid side chains. 
Similarly organic compounds such as nicotinamide 
nucleotides, flavine nucleotides, porphyrins, thiamine 
phosphates, serotonin, and the like, including 
inhibitors, agonists and antagonists of known 
biological functions, can interact with the nascent 
polypeptide to modify its 3 -dimensional folded 
configuration. As thus modified, the nascent 
polypeptide can exhibit different binding properties 
than an unmodified polypeptide. The use of such 
conf igurational modifiers enhances the range of 
potential binding activities of any candidate mixture 
of polypeptides. Also, it affords a means for 
selecting polypeptides having conditionally reversible 
functions, i.e., capable of being functionally "off or 
"on"", depending on the presence or absence of the 
modifier. Conf igurational modifiers need not be 
naturally-occurring compounds. The use of such 
modifiers during partitioning is only limited by the 
need to maintain stability of the ribosome complexes. 
Modifiers which disrupt ribosome complexes or which 
degrade the coding mRNA or nascent polypeptide should 
be avoided. A modifier can be included in the buffer 
or medium during partitioning. Alternatively, SPERT 
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. itself can be used to pre-select polypeptides which 
bind the modifier as a target, after which the candidate 
mixture of selected modifier-binding polypeptides can 
be further selected, via SPERT, for binding the . , 
5 ultimate target. * 

. • : Sequence variation in the test coding mRNA 

mixture can be achieved or increased, by mutation. For 
example, a procedure has been described for efficiently 
mutagenizing nucleic acid sequences during PCR 
10 amplification (Leung et al. 1989). This method or 
functionally equivalent methods can optionally be 
combined with amplification procedures in the present 
invention. 

Alternatively, conventional methods of DNA 
15 mutagenesis can be incorporated into the nucleic acid 
amplification procedure. Applicable Mutagenesis 
procedures include, among others, chemically induced 
mutagenesis and oligonucleotide site-directed 
mutagenesis. 

2 0 The starting mRNA mixture is not limited to 

sequences synthesized de novo . In particular, SPERT 
can be used to modify the function of existing 
proteins. A segment of the natural sequence is 
replaced by a corresponding segment of randomized 

25 sequence in the mRNA that encodes the protein. Since 
many known proteins belong to families having some 
sequences conserved and others varied, the logical 
approach is to replace the variable (or hypervariable) 
regions with randomized sequence, to maximize the 

30 chance of altering function. The proper choice of 

partitioning conditions, as will be apparent to those 
skilled in the art, results in selection for the 
desired functional variant. In this way, 
• modifications, alterations and improvements on known 

35 proteins can be achieved. 

To proceed to the amplification step when 
utilizing ribosome complexes, coding nucleic acids must 
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be released from the target-bound ribosome complexes 
after partitioning/ This process must be done without 
chemical degradation of the coding mRNA's and must 
result in amplifiable nucleic acids. In a specific 
embodiment, selected coding RNA molecules are eluted 
from a column.' using a high ionic strength buffer or 
other eluant'- capable of disrupting-, the -ligand- target 
bond. Alternatively, the ribosome can be denatured 
such that the mRNA is eluted. The coding mRNA can be 
removed from ribosome complexes or from ribosome 
complex-target pairs by phenol extraction or by phenol 
combined with a protein denaturing agent such as 7M 
urea. Although ribosomal RNA is also extracted, 
subsequent amplification is selective for the mRNA's 
because the primers used for cDNA synthesis and PCR 
amplification are complementary only to a conserved 
sequence in the mRNA's and not to ribosomal RNA. 

' As the translation of randomized mRNAs proceeds 
during the SPERT protocol, the growing polypeptide 
makes its way from the peptidyl transferase site within 
the large ribosome subunit toward the cytoplasmic 
solvent. The peptidyl transferase site is an intrinsic 
activity of the large ribosome subunit from all 
organisms; that site has been defined functionally but 
its precise location within the ribosome is unknown. 
However, the distance between that site and the 
cytoplasmic solvent also is known to be about 3 0 to 40 

amino acids in length. 

For optimal effectiveness in SPERT , the random 
portion of the nascent polypeptide (whose properties 
are selected during the procedure) should be "outside" 
the ribosome in order for partitioning of the ribosome 
complex to fully utilize the properties of the 
randomized polypeptide. A C-terminal trailer sequence 
is preferably incorporated into the translated 
polypeptide to insure that the randomized sequence is 
fully exposed after translation. From the work of 
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Smith.et. al, (PNAS, 25:5922, 1978); and Malkin and Rich 
(j. Mol. Biol., ££:329, 1967) for both prokaryotes and 
eukaryotes: about 30 to 40 amino acid residues remain 
within the ribosome during translation. Furthermore, 
if the amino-terminus of a growing polypeptide contains 
a hydrophobic domain of about 20 a amino acid residues, a 
nascent polypeptide of about 50 residues has been shown 
to be enough to allow the translation complex to 
interact with a membrane by hydrophobic interactions, 
see Kurzchalia et al, Nature 320:634, 1986). Thus, in 
those preferred embodiments of SPERT utilizing ribosome 
complexes, the randomized polypeptide will be encoded 
by randomized mRNA that is about 30-40 codons (that is, 
about 90-120 nucleotides) upstream from the codons at 
which the translation complex stalls. It will be 
understood that both longer and shorter C-terminal 
trailer sequences can be used effectively, and that 
SPERT, itself, can be used to determine optimum trailer 
length for a given partitioning system. The sequence 
of mRNA and encoded polypeptide in the C-terminal 
trailer can be designed to have any other desired 
function, such as more stability in the translation 
complex, ease of in vitro manipulation, subsequent 
polypeptide purification, as a reporter activity for 
diagnostics, cell intry, etc. 

Polypeptides selected by SPERT can be produced 
by any peptide synthetic method desired. Chemical 
synthesis can be accomplished since the amino acid 
sequence of the polypeptide is readily obtainable from 
the nucleotide sequence of the coding mRNA. Since cDNA 
from the coding mRNA is available, the polypeptide can 
also be made by expressing the cDNA in a suitable host 
cell . 

SPERT offers, as noted above, an opportunity to 
sample as many as 10 18 peptide sequences during a 
rigorous experiment with a particular target. As such 
SPERT may be compared with in vivo technologies aimed 
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at uncovering peptides with 'specif ic binding 
properties. These technologies, • lumped together under 
the name "phage display systems", have been available 
for more than five years (see, Smith, Science 22JB,:1315, 
1985) and widely appreciated in the last year (See, 
e.g., Charbit etal., EMBO J. 1:3029, 1986; Parmley et 
al., Gene 22:305/ 1988; "Scott et al. / Science 2A£:386, 
1990; Devlin et al., Science 249:404, 1990; Cwirla et 
al., Proc. Natl. Acad. Sci. 82:6378, 1990). Because 
phage display systems depend, in their present form, on 
a transformation step with either plasmid or phage DNA, 
the intrinsic depth of those systems is less than in 
SPERT. Phage display systems allow 10 9 different 
peptides to be searched easily, and perhaps 10 11 or so 
with bigger volumes and more difficulties. SPERT thus 
has a value for looking rigorously through large 
libraries . 

' Both SPERT, as defined thus far, and the phage 
display systems have a disadvantage in common, at least 
formally. In SPERT the peptide of interest is held by 
the ribosome, a machine that contains its own proteins 
and which is extremely large relative to the peptide of 
interest. Similarly, in the phage display systems the 
peptide of interest protrudes from a phage particle 
which is also relativeiy extremely large and which 
contains its own proteins. Although each of these 
systems will yield a peptide of interest with careful 
partitioning of the bound peptide from all other 
peptides bound to ribosomes or phage capsids, an 
improved system would provide the peptide of interest 
bound to an encoding nucleic acid (to achieve reverse 
translation) free of any other large, proteinaceous 
components. As described above, the large phage 
particle and the ribosome add limitations to these 
systems other than in the partitioning step of the 
process. The large entities also severely limit the 
number of random peptides that may be practically 
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generated and tested in the screening process. 

SPERT lends itself to such an improvement. . In 
an alternate embodiment, this invention contemplates a 
simple and general mechanism by which a non-random 
5 portion of each peptide within the collection of 

peptides becomes covalently or very tightly attached to 
' one end or the other of the mRNA that encodes it to 
form mRNA polypeptide copolymers. 

There are an almost unlimited number of 

10 specific systems that could be employed to generate 
mRNA • polypeptide copolymers. Any such system that 
allows the ribosomes in the translation mixture to have 
a high turnover can be useful. The An vitro reactions 
should be as free as possible from RNases. The RNAse 

15 problem may also be alleviated by usinc, amtant strains 
to lower RNase levels. Alternately, various techniques 
familiar to those skilled in the art are available for 
making the mRNA nuclease resistant. Additional 
criteria for effective systems for forming 

20 mRNA* polypeptide copolymers include the following: 1) 
the interactions between the nascent polypeptide and 
the mRNA must either occur before the ribosome complex 
is disrupted, or at a rate that highly favors the 
interaction over dissociation of the proximal species; 

25 2) additional reagents should be relatively small; and 
3) the reaction between the nascent polypeptide and the 
mRNA should be relatively efficient (i.e., at least 
about 5% or greater) . 

A nonlimiting catalog of methods that can be 

30 employed to generate mRNA •polypeptide copolymers will 
generally fall into the following categories: 1) 
Adapted post-translational modification systems; 2) 
Activation of the 5* end of the mRNA species and the N- 
- terminus of the peptides to promote relatively simple 

35 organic chemical type reactions between the species; 3) 
Attachment of the peptide to the mRNA prior to the 
onset of translation; and 4) tRNA crosslinking of the 
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nascent 'polypeptide and the VBBK.u Various embodiments 
of each of these systems is described below. The 
design of additional embodiments of these general 
systems would also be obvious to those skilled in the 
5 art. ■ 

pnst. Transl'ationai Modifier ^"* Systems 

In one embodiment the collection of mRNAs used 
in SPERT is synthesized using T7 RNA polymerase and 5- 
10 guanosine phosphono monothioate for initiation (see, 

Burgin et al. , EMBO J. 2:4111, 1990, for example), the 
monothioate is incorporated only at the 5' end; 
nucleoside triphosphates are the source of all internal 
residues during transcription. Organic tags may be 
attached to the 5' end without difficulty, and without 
harming the RNA for other functions. 'Thus each mRNA in 
the collection could have, for example, biotin or any 
one of a number of small reagents affixed to the 5-' end 
of the RNA. Alternatively, mononucleotides labeled 
20 with biotin could be used to initiate transcription. 
The 5' end of the RNA would certainly not preclude 
translation by bacterial ribosomes, since those 
ribosomes are indifferent to the chemical nature of the 
5' end as long as enough nucleotides are present 
25 upstream of the initiating AUG and as long as those 
nucleotides contain appropriate sequences to cause 

initiation to occur. 

According to this embodiment, the codons 
downstream from the AUG, also fixed, encode a peptide 
30 that has an extremely high affinity for or can be 

covalently bound to the chemical adduct positioned at 
— the-5' end of - each ~mRNAv Known peptide -sequences (such 
"as avidin) might "be"us"ed if biotin were 'the chosen 5' 
• tag. In one example, a biotin ligase may be used to 
35 make covalent the interaction between the peptide and 
the biotin at the 5' end of the mRNA. See, Cronan, 
(Cell, 58:427, 1989); Reed and Cronan (J. Bio. Chem., 
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2i£:11425, 1991) incorporated. herein by reference. 
Many suitable pairs of chemical adducts and fixed 
peptide sequences have been identified, and are known 
to those skilled in the art. For example, certain 
5 polypeptides contain lipoylat ion sites, and the post- 
.translation. modification would utilize the lipoylation 
system; See, Rucker e£_al. , (FASEB J. , 2:2252-61, 
1988); Ali et al . . (Mol. Microbiol. 4:943-50, 1990). 
For other post-translational modification systems, see, 

10 PCT Patent Application PCT/US90/02852 (published 

November 29, 1990, W0 90/14431). 

As the nascent peptide emerges from the 
ribosome, the most likely 5' adduct to be bound by that 
peptide sequence will be the 5 1 adduct on the mRNA 

15 encoding that exact peptide (which will include, in 

this case, randomized peptide sequences downstream of 
the fixed peptide adjacent to the initiating 
methionine) . Again, with respect to biotin' and biotin 
ligase, the first collisions will be irreversibly 

2 0 fixed. The length of the 5 1 end of each mRNA (that is, 

how many nucleotides upstream of the ribosome binding 
site are needed to enhance the binding reaction in cis) 
and the concentration of ribosomes that allow 
collisions between the nascent peptide of one ribosome 

25 and the 5 1 end of the mRNA of another can be determined 
easily without undue experimentation. This last point 
is clear from a simple calculation. Ribosomes are 
about 200 angstroms in diameter, so it may be assumed 
that the distance between the nascent, emergent peptide 

30 (from the large ribosome subunit) and the emergent 5 1 

adduct of the mRNA (from the small ribosome subunit) 
will never be more than 500 angstroms apart and could 
be much less. The calculated concentration of the 
nascent peptide with respect to its own 5 1 adduct in 

35 cis is higher than 3 micromolar for a worst case 
scenario, and could be more than 100 times higher. 
Since the ribosome concentration in many cell-free 
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translation experiments is sub-micromolar, it is not 
difficult to preclude scrambled binding between nascent 
peptides and 5- mRNA adducts on other ribosomes. 

As translation ends, after mRNA polypeptide 
copolymer formation , and prior to enrichment for 
peptides that partition with a target', -the- cell-free 
reaction may be treated with puromycin and EDTA to 
disassociate the ribosomal subunits. ATA, poly U, or 
other non-amplif iable RNAs may be added to prevent 
rebinding of mRNAs to the ribosomes. Size 
fractionation may then be used to enrich for small 
material, and/or high speed centrifugation would 
eliminate the ribosomes and many of the proteins from 
the cell-free system from the mRNA • polypeptide 
copolymer (such copolymers may be truly covalent or 
merely effective copolymers when very' high affinities 
are used for the linkage) . More complete purifications 
of the copolymer prior to partitioning with target are 
obvious. For example, hybridization to column-bound 
complementary DNA (to one end of the mRNA) and 
subsequent elution would give full purification. 
Similarly, the fixed peptide could include an 
additional sequence for this purification; a small 
epitope would do, thus allowing purification of the 
mRNA-polypeptide copolymer with antibodies against that 
epitope. 

The mRNA • polypeptide copolymer is partitioned 
as in the ribosome complex examples, and the bound mRNA 
amplified via cDNA synthesis and PCR, as always 
extending the cDNA to create again the T7 promoter 
sequence for the next round of SPERT. The peptide 
attached to the 5' end of the mRNA may cause the 3' end 
of the cDNA to be a bit shorter than in the absence of 
- peptide, but PCR easily accomplishes the full 
restructuring of the DNA for subsequent transcription, 
in this case initiated once again by phosphono 
monothioate nucleotide for adding the small organic 
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molecule* needed for linkage. • 

In this alternate embodiment of SPERT, the 
peptide is directly linked to the encoding nucleic acid 
and is partitioned to target (or reacted in any other 
way described for SPERT) with only the encoding nucleic 
acid availabler. (along- with ; the peptide^ collection) for 
that target!. The very large ribosome or phage capsid 
no longer obscures the partitioning reaction in any 
way. 

Estivation of 5' end of mRNA and N-terminus of peptide. 

The post-translational modification systems 
described above generally require an enzyme to 
facilitate the reaction between the nascent peptide and 
the mRNA. According to this embodiment, the modifying 
enzyme is eliminated, and relatively simple chemical 
reactions are relied on to form the copolymers. 

In one embodiment of this system, sOlf ur-halide 
chemistry is employed. Sulfur may be incorporated on 
the 5 1 end of the mRNA using the T7 RNA polymerase and 
monothiate for initiation as described above. A halide 
can be incorporated on the N-terminus of the peptide by 
use of N-haloacetyl-met-tRNA fmet (Pellegrini et al. , 
(Proc. Natl. Acad. Sci. USA, 69:83741, 1972); Sopari, 
et al ., (Biochemistry, 13:5432-39, 1976)). This 
combination would result in spontaneous nucleophilic 
substitution to form a thioether linkage between the 
nascent polypeptide and the mRNA. In order to avoid 
reaction of the halo-acetyl group with DTT in the 
translation mixture, or with cysteine residues in 
ribosomal proteins, it is preferred that the chloro 
acetyl functionality be utilized. 

In a further embodiment of this process, it may 
be desirable to accelerate the reaction between the 
nascent polypeptide and the mRNA by introducing a 
"chaperone" RNA sequence. The chaperone acts as a 
catalyst to facilitate the nucleophilic substitution 
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taction." An appropriate chaperone sequence, »ay. be 
ea sily selected by one skilled in the art ^"J"*** 
SELEX technology. A useful chaperone may be selected 
by placing a stretch of random noncoding RNA adjacent 
the 5' GMPS mRNA, and collecting those sequences ■ 
capabie-of :i r^cting with the halo-acetyl N-termmal 
polypeptide. This reaction "could be further , 
facilitated by selecting fixed amino acids at the N- 
terminal end that would present a probable nucleic acid 
interaction site. In further embodiments, the 
chaperone could be an RNA or protein acting as a true 
catalyst to facilitate the reaction. 

rr ir nna of mPN* to Peptjde A 

In one embodiment of the formation of 
^•polypeptide copolymers, the mRNA may be coupled to 
the nascent polypeptide before translation is 
initiated, in one embodiment, this pre-translational 
coupling would occur by attaching the 5' end of the 
mRNA to W-amino group of methionine on met-tRNA 
via a covalent linker. As translation proceeds, the 
initiating methionine is already attached to the mRNA 
at the initial amino acid sequence. 

| 

tm o^ s i inki m " f Mocsagp and repute . 

According to this embodiment, a covalent 
linkage is created between peptidyl-tRNA and mRNA. A 
specific embodiment of this system is based on studies 
of the photoreaction between the -Y- base of yeast 
tRNA* e and mRNA. See, Matzke et^l. , (Proc. Natl. 
Ld. sci. USA, 77:5110-14, 1980). See also, Stemer 
et al., (Nucl. Acids. Res. 12:8181-91, 1984) 
"Titration that tRNA can undergo peptidyl transfer 
and translocate normally from A-site to P-site after 
being crosslinked to mRNA); Paszyc et_ai- (Nucl. Acid. 
Reg. 6:385-97, 1979). A nonsense suppressor containing 
the Y base may be used that will crosslink to the 
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^nral amino acids used to construct modern proteins, 
■r^Tof possible seances of protein^ o* average 
size is 20- or ~10< 00 . Estimates of the number of 
Nicies in the universe are in the range io". whUe 

IsSates for the" number of ^T^Tx*? 
th. entire history of the earth are m.the range 10 . 
The tiny fraction of so-called sequence space that has 
Tne tiny result of evolutionary 

been explored by biology is a result 

history and the relatively short age of the earth. The 
present invention provides the means to explore protein 
sequence space without historical and evolutionary 
ligation!, while continuing to respect limitations 

stablished b y the number of particles in the universe. 
Th e invention provides the means to identify and 
isolate polypeptide ligands with any desired quality 
fr om vast mixtures of protein sequences 

argely of individual entities that have never before 
existel The amino acid sequence of the selected 
Ugand can be learned from the nucleotide sequence of 
it! encoding mRNA, making tedious amino acid sequence 
determination unnecessary. 

Even where the binding functions selected by 
SPERT have known naturally occurring counterparts, 
there is no reason to expect that the polypeptides 
selected by SPERT will resemble naturally-occurring 

0 :iins or peptides having similar function^ In-st 
instances, SPERT-selected polypeptides will be smaller 
than naturally-occurring proteins typically hav.ng a 
size of from 4-100 amino acids, preferably from 4-50 
amino acids selected from randomized sequence of the 
lame length, and also having a C-terminal trailer 
about 30-40 amino acids and, optionally a N-terminal 

eader of about 10 amino acids, for a total ength of 
about 100 amino acids, corresponding to a molecular 
weight of about llkd. This is smaller than most 
enzymes and all antibodies, for comparison, IgG has 
n olecular weight of about 150kd. Furthermore, many 
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"polypeptide ligands of , the invention will; function when 
freed by N- and C- terminal trailers* Therefore, the 
final product can be as small as 4-50 amino acids.. The 
polypeptides of the invention are non-naturally- 
5 occurring,: and typically differ in amino acid sequence 
and molecular "size from naturally-occurring proteins. 
That portion of the amino acid sequence arising from . 
randomized coding is designated the "binding segment" 
herein. The binding segment can be of any length, 

10 conveniently ranging from about 4-100 amino acids in 
length, preferably from about 15-50. amino acids in 
length. Additionally, given the vastness of sequence 
space, it is expected that most polypeptide ligands of 
the invention will have less than 50% homology with 

15 natural proteins, and preferably less than 3 0% amino 
acid homology with natural proteins. 

A polypeptide ligand of the invention in a 
number of ways functionally resembles an antibody; 
Polypeptide ligands which have binding functions 

2 0 similar to those of antibodies can be isolated by the 
methods of the present invention. Such polypeptides 
are generally useful in applications in which 
polyclonal or monoclonal antibodies have found 
application.. However, the polypeptide ligands of the 

25 invention have significant advantages over antibodies: 
they can be selected for any desired affinity, 
including higher affinities than are obtainable with 
antibodies, they can be selected to bind at any desired 
epitope or combination of epitomes, including binding 

30 sites not recognized by antibodies, they can be larger 
or smaller and have different solubility properties 
than antibodies and they can be generated by techniques 
that operate entirely in vitro , without the need for 
live animals or cell culture techniques. Applications 

35 of polypeptide ligands include the specific, 

qualitative or quantitative detection of target 
molecules from any source; purification of target 
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s , ZZe* are prefers proteins, but -so include 
among others carbohydrates, nucleic acids, . 
i lettldoglycans and a variety of srnaU molecules 
' V :1 ' P „lfh conventional antibodies, polypeptide ligands an 

„e employed to target biological structures, such as 
10 cel surLces or viruses, through specific interacts 
10 Tith a molecule that is an integral part of that 
b iological structure. Polypeptide ligands are 
advantageous in that they are not lifted by self 
tolerance, as are conventional antxbod.es. Also as 
15 noted, polypeptide ligands of the invention do not 
1 require animals or cell cultures for synthesis or 

Auction, since SPERT is a wholly ia B» P™~- 
Le methods of the present invention 

of polypeptide ligands can generate novel polypeptides 
,„ ^afolnd targets for which other proteinaceous Uganda 
are Mown- For example, a number of proteins are Known 
are Known. . * nuc leic acid sequences, such 

to function via binding to nucieio a 
as regulatory proteins which bind to nucleic acid 
operator sequences. The Known ability of certain 
„ nucleic acid binding proteins to bind to their natural 
site for example, has been employed in the detection, 
quantitation . isolation and purification of such 
Steins. The methods of the present invention related 
P t L use of polypeptide ligands can be used to maKe 
novel nucleic acid binding ligands having affinity for 
30 "acid sequences which are Known to bind proteins 
and to nucleic acid sequences not Known to bind 

r teins. Hovel, non-naturally-occurring 
which bind to the same binding sites of nucleic acids 
35 can be developed using SPERT. As will be discussed 

beiow, certain poiypeptides isolatable by SPERT can 
TL e employed to affect the function, (for example 
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inhiiiit, ' enhance or activate) specific target- molecules 
or structures. Specifically, polypeptide ligands can 
be employed to inhibit, enhance or activate the 
function of proteins and of nucleic acids. 

It is' a second important , aspect # of the present 
invention - that the methods , described .herein. t can be 
employed to identify, isolate or produce polypeptide 
molecules which will bind specifically to a particular 
target molecule and affect the function of that 
molecule. In this aspect, the target molecules are 
again preferably proteins or nucleic acids, but can 
also include, among others, carbohydrates and various 
small molecules to which specific polypeptide binding 
can be achieved. Polypeptide ligands that bind to 
small molecules can affect their function by 
sequestering them or by preventing them from 
interacting with their natural ligands. For example, 
the activity of an enzyme can be affected by a 
polypeptide ligand that binds the enzyme's substrate. 
Polypeptide ligands of small molecules are particularly 
useful as reagents for diagnostic tests, or other 
quantitative assays. For example, the presence of 
controlled substances, bound metabolites or abnormal 
quantities of normal ^metabolites can be detected and 
measured using polypeptide ligands of the invention. 
Antibodies to polypeptide ligands can be used to 
precipitate or bind ligand-target pairs to a solid 
phase matrix in a diagnostic assay. A polypeptide 
ligand having catalytic activity can affect the 
function of a small molecule, by catalyzing a chemical 
change in the target. The range of possible catalytic 
activities is at least as broad as that displayed by 
natural proteins. 

The strategy of selecting a ligand for a 
transition state analog of a desired reaction is one 
method by which catalytic polypeptide ligands can be 
selected. Polypeptide ligands with high affinity for 
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transition-state analogues.are' liXely. tohave .enzymatic 
activity, as has been demonstrated for monodon. . 
bodies directed against transition-state analogues. 
These antibodies have exhibited a wide range of _ 
Catalytic activities, .including acyl-transfer reactions 
et-al., science ^-.WOiXlMU Tramant.no et 
'l science 221=1570 (1986); Jacobs etal., J. »• 

cnem Soc. i^:™* (!»«>» ««« et 3l - SClenC \ nB1 
T. 041 (1-7,; .anda et al.. Science m^e <» > ' 
■Suit*. P."- Science 21^2* (».•» BenXoviC et al. , 
pfoc. Hatl. Acad, sci. (»••)!. carbon-carbon 

bond formation [JacKson et al., J- »• Ch ^- S0C - 
£.,4.41 (1988); Hilvert and »area. J. Am. chem. Soc 
^ 5593 1988)], carbon-carbon bond cleaving reactions 
£S « al., ». Am. SOC. ^=7888 (1988), 

peptide cleavage [Iverson and Lerner, Scxence 2^.1184 
, 19B9)] , and ester bond hydrolysis (Janda et al 

cience 244,437 (».»))• The ~ ° £ TTbv 1 PBKT 
sequences and structures that can be explored by SPERT 
far exceed those available in the immune system. 

Enzymes are evolved using SPERT and startup 
randomized sequences corresponding to about 50 amino 
ITZ as in Sample 3. Enzymatic polypeptide ligands 
o( lall size are entirely unanticipated by the present 
understanding of enzymology; enzymes are always much 
larger in nature than the scientist expects. The 
specific transition state analogues used are drawn from 
the literature cited above. Among the reactions probed 
by the monoclonal antibody-enzymes are some which lead 
to the breakdown of toxic waste products, mdudrng 
Thecals with chlorine-carbon bonds and carbon-carbon 
bonds in ring structures li*e those found in benzene 

and polychlorinated phenols. 

The binding selection methods of the present 

invention can be coined with secondary selection or 
screening to identify ligands capable of modifyxng 
target molecule function upon binding. The large 
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population of variant amino acid, sequences that can be 
tested by SPERT enhances the probability that 
polypeptide sequences can be found that have a desired 
binding capability and that function to modify target 
molecule activity. The methods of the present 
invention .are .useful .for ^electing ^polypeptide ligands 
which can selectively affect function of any target 
protein- The methods described herein can be employed 
to isolate or produce polypeptide ligands which bind to 
and modify the function of any protein or nucleic acid. 
It is contemplated that the method of the present 
invention can be employed to identify, isolate or 
produce polypeptide molecules which will affect 
catalytic activity of target enzymes,, i.e., inhibit 
catalysis or modify substrate binding, affect the 
functionality of protein receptors, i.'e., inhibit 
binding to receptors or modify the specificity of 
binding to receptors; affect the formation of protein 
multimers, i.e., disrupt quaternary structure of 
protein subunits; and modify transport properties of 
protein, i.e., disrupt transport of small molecules or 
ions by proteins. 

Secondary selection methods that can be 
combined with SPERT include among others selections or 
screens for enzyme inhibition, alteration of substrate 
binding, loss of functionality, disruption of 
structure, etc. Those of ordinary skill in the art are 
able to select among various alternatives those 
selection or screening methods that aire compatible with 
the methods described herein. 

An embodiment of the present invention, which 
is particularly useful for identifying or isolating 
polypeptides which bind to a particular functional or 
active site in a protein, or other target molecule, 
employs a molecule known, or selected, for binding to a 
desired site within the target protein to direct the 
selection/amplification process to a subset of 
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polypeptide' ligands that bind + at or near the desired 
site within the- target molecule. In a simple example, 
a polypeptide sequence known to bind to a desired site 
in a target molecule is incorporated near the 
randomized region of all polypeptides being tested for 
binding^ * SPERT -is then -used'~tb select- thosevyariants , , 
all'of which will contain 'the known -binding sequence, 
which bind most strongly to the target molecule. A 
longer binding sequence, which is anticipated to either 
bind more strongly to the target molecule or more 
specifically to the target can thus be selected. The 
longer binding sequence can then be introduced near the 
randomized region of the polypeptide test mixture and 
the selection/amplification steps repeated to select an 
even longer binding sequence. Iteration of these steps 
(i.e., incorporation of selected sequence into test 
mixtures followed by selection/amplification for 
improved or more specific binding) can be repeated 
until a desired level of binding strength or 
specificity is achieved. This iterative -walking" 
procedure allows the selection of polypeptides highly 
specific for a particular target molecule or site 
within a target molecule. Another embodiment of such 
an iterative "walking" procedure, employs an "anchor- 
molecule which is not necessarily a polypeptide or 
amino acid. In this embodiment a molecule which binds 
to a desired target, for example a substrate or 
inhibitor of a target enzyme, is chemically modified 
such that it can be covalently linked to a bridge 
molecule which in turn is known to be bound to an 
oligopeptide of known sequence. The bridge molecule 
covalently linked to the -anchor" molecule that binds 
to the target also binds to the target molecule. The 
' sequence encoding the known bridge-binding oligopeptide 
is incorporated near the randomized region of the test 
nucleic acid mixture. SPERT is then performed to 
select for those polypeptide sequences that bind most 
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strongly to the target 'molecule/bridge/ anchor complex. 
The iterative walking procedure can then be employed to 
select or produce longer and longer polypeptide 
molecules with enhanced strength of binding or 
5 specificity of binding to the target. The use of the 
"anchor" - procedure -is expected to allow* more-* rapid 
isolation of -polypeptide ligands that bind at or near a 
desired site within a target molecule. In particular, 
it is expected that the "anchor" method in combination 

10 with iterative "walking" procedures will result in 

polypeptides which are highly specific inhibitors of 
protein function. 

In accordance with the teachings of copending 
applications Serial No. 07/536,428 and Serial No, 

15 07/714,131, the translated mRNA of a ribosome complex 

or mRNA • polypeptide copolymer is, in principle, capable 
of binding to target molecules and of being partitioned 
concurrently with* nascent polypeptides. In particular, 
where partitioning is accomplished by affinity 

20 chromatography, the selected ligand can be an RNA, 
rather than a polypeptide. Binding of mRNA can be 
differentiated from polypeptide binding once the ligand 
has been selected and both the selected polypeptide and 
its coding mRNA are available for independent direct 

25 binding studies where the two are not part of the same 
ribosome complex. Comparative studies of the relative 
frequency of RNA ligands and polypeptide ligands 
selected by SPERT are of fundamental biological 
importance to understanding the specialization of 

30 function that currently exists in living cells. This 

direct comparison between RNA and peptide during the 
SPERT cycles may prove to be surprisingly robust. As 
described in the SELEX applications, large numbers of 
v protein targets will yield. a tight-binding RNA ligand. 

35 For a given target it can not be predicted whether RNA 
or peptide will give more useful ligand solutions, and 
thus SPERT can be seen as an improvement to the SELEX 
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application because when RNA . yields' the best .ligand 
solutions the data will lead to that conclusion 
immediately. For example, the RNA ligand solutions 
will be indifferent to the reading frame in which the 
conserved RNA sequence or structure is found, while the 
peptide solutions will force the : RNA solutions to have 
a conunon sequence in the same -.reading frame.. . 

The polypeptides of the invention can be 
selected for other properties in addition to binding. 
For example, during partitioning, stability to certain 
conditions of the desired working environment of the 
end product can be included as a selection criterion. 
If a polypeptide which is stable in the presence of a 
certain protease is desired, that protease can be part 
of the buffer medium used during partitioning. As will 
be understood, when utilizing ribosome complexes 
conditions which disrupt ribosome complexes should be 
avoided. Other desired properties can be incorporated, 
directly into the polypeptide sequence as will be 
understood by those skilled in the art. For example, 
membrane affinity can be included as a property, either 
by employing a N- or c-terminal trailer having high 
hydrophobicity, or by biasing the randomized coding to 
favor the amino acids with lipophilic side chains. 

The coding nucleic acid concomitantly selected 
by partitioning nascent polypeptides as described, is 
useful in its own right to transform host cells or 
organisms. The transformed organism is then useful 
for, e.g., fermentation production of the selected 
polypeptide. A transgenic organism can be rendered 
resistent to a virus infection, for example, by causing 
in vivo synthesis of a polypeptide ligand of the viral 
nucleic acid or a key viral protein. In principle, any 
functionality contributed by a polypeptide ligand of 
the invention can be bestowed on a suitable host 
organism. Methods known in the art can be used to 
combine the coding region with a promoter, 



Ji polyadehylat'ion. signal; functional in, the intended host, 
followed by incorporation into a. suitable vector for 
transformation, all as known and understood in the art. 

5 EXAMPLES ' 

( , /• " The techniques and methods used in /the ensuing 
examples are published and known in the art. Together 
with adaptations and modifications known to those of 
ordinary skill in the art, the procedures not 

10 specifically referenced herein are available from known 
reference works. In addition to Sambrook et al., 
(1989) supra , Genetic Engineering . Plenum Press, New 
York (1979); Weir, (ed.) (1986) Handbook of 
Experimental Immunology in Four Volumes . 4th Ed, 

15 Blackwell Scientific Publications, Oxford; and the 

multivolume Methods in Enzvmologv published by Academic 
Press, New York. Polymerase chain reaction techniques 
are described in PCR Protocols (Michael A. 'Innis, *et 
al. eds.) (1990) Academic Press, Inc. 

20 Throughout examples 1-9, reference is made to 

Tables 1 and 2. Table 1 lists oligonucleotide 
sequences used for preparing mRNA candidates. Table 2 
lists the same sequences together with explanatory 
notes showing functional domains. Sequences in 

25 capitals are chemically synthesized, sequences in lower 
case letters are complementary sequences made 
enzymatically by DNA polymerase. The Examples could be 
adapted by those of ordinary skill in the art to 
generate mRNA • polypeptide copolymers as taught herein 

30 without undue experimentation. 
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Example 1. Direct Immunoprecipitation of Ribosome 

Complexes: Polypeptide Ligands Directed 
Toward Immunoglobulin Molecules. 

The method of the invention is used to select 
novel polypeptides that bind the antibody of an epitope 
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-commonly" recogni Z ed;brthe ant isera, from auto.i^une 
mi ce which are the. fl progeny of a cross of NZB and NZW 
parents (Portanova et al., J. Immunol. 144, 4633 
(1990). The known epitope consists of about 10 . 
contiguous amino acids at the amino terminus of the 
nistone H2B protein. To make. mRNA encoding candidate 
polypeptides, a 5- fixed sequence composed of a T7 
promoter sequence and a ribosome binding site which is 
recognized by both prokaryotic and eukaryotic 
ribosomes, terminating in a restriction endonuclease 
site is synthesized and cloned using oligonucleotides 
having the sequences shown as sequence 1 in Tables 1 
and 2 and in Figure 8. A3' fixed sequence is placed 
into a restriction site to provide an mRNA encoding the 
C-terminal trailer sequence of ca. 100 nucleotides 
lacking stop codons (for ca. 30-35 amino acids) shown 
as sequence 3 in Tables 1 and 2 and Figure 8. In 
addition, as shown in Figure 1,-a 3- primer annealing 
site (sequence 3) is provided so that cDNA synthesis 
can be accomplished on the mRNA recovered from 
partitioned ribosome complexes. 

The randomized polypeptide insertion site is 
bounded by restriction endonuclease recognition sites, 
in this example EcoRI and Pstl. A single-stranded 
oligonucleotide is synthesized with a randomized 
sequence of 45 nucleotides (corresponding to 15 codons) 
bounded by specific sequences that include those two 
restriction endonuclease sites (Sequence 4a) . 
Synthesis of randomized oligonucleotides is carried out 
using an Applied Biosystems DNA synthesizer provided 
with a reactant mixture for each nucleotide position. 
To partially compensate for the amino acid sequence 
bias inherent in the redundancy of the genetic code, 
the reaction mixtures contain, on a mole percent basis, 
the following composition of bases for each codon: 
First position, C-20%, T, A, and G-30% each; Second 
position, C-15%, A-35%, T and G-25% each; Third 
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position, -T, < C, A and ;G-25% r:each.„ Using nucleic acid 
primer that is complementary to the fixed.. 3' end of the 
randomized oligonucleotide, randomized double-stranded 
DNA is created with the action of DNA polymerase. The 
products are- digested with, the .two restriction 
-endonucleases and ligated between the 5 V fixed sequence 
and the'3» 'fixed sequence discussed above. In vitro 
transcription of these ligated templates using T7 RNA 
polymerase (Bethesda Research Laboratories, 
Gaithersburg, MD) provides mRNA templates for in vitro 
translation. A rabbit reticulocyte lysate system (BRL) 
is used to translate the mRNA templates in vitro / using 
standard reaction conditions. Such translation of 
these transcripts results in a variety of ribosomal 
complexes (mRNA-na scent polypeptide-tRNA-ribosomes) 
that are identical except for the randomized region of 
the nascent polypeptide. 

/ Antibodies. (IgGs) , Portamova et al . / supra > 
which recognize the H2B histone epitope are added to 
the in vitro translation mixture. Immunoprecipitation 
of the immunoreactive ribosome complexes partitions the 
mRNAs species that encode the highest-affinity 
polypeptide ligands in the population (see Figures 3 
and 4) . Immunoprecipitated complexes are separated by 
low speed centrifugation. cDNA is synthesized from 
these mRNAs and is used via PGR to provide template for 
further cycles of transcription, translation, 
immunoselection and cDNA synthesis. 

Clones are isolated as described in Application 
07/536,428, June 11, 1990, incorporated herein by 
reference. The individual polypeptide products are 
over- produced and purified and tested, using standard 
techniques for reactivity to the anti-H2B histone 
antibodies. In addition, the polypeptide ligands are 
challenged competitively with authentic histone H2B- 
derived epitomes to discover which polypeptide ligands 
bind to the same portion of the antibodies as the true 
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lepitope.'' Among the polypeptides', isolated .that bind the 
antibody are found those having less than 50% sequence 
homology with the H2B histone epitope. Other antibody 
binding sequences are identified having less than ,30% 
homology with the H2B histone epitope. Other 
polypeptide ligands . of the rantibody.jdo, not compete for 
the H2B epitope binding site. • ' ' - ' 

Example 2. Diagnostics using the polypeptide ligands 
of Example 1: An assay for anti-H2B 
antibodies in the progeny of NZB X NZW 
mice. 

Auto- immune diseases result from the 
elaboration of an inappropriate antibody molecule with 
reactivity toward a normal cellular component (often a 
protein, but sometimes a nucleic acid; as in Systemic 
Lupus Erythematosis - SLE) . Polypeptide ligands 
generated through the SPERT protocols in Example 1 are 
aimed at diagnosis of mouse "Lupus" in the offspring of 
NZB X NZW mice. SPERT is used to identify and obtain a 
reagent ligand for the diagnostic recognition of the 
auto-antibody that recognizes the histone H2B epitope. 

As in Example 1, ribosome complexes are treated 
with the auto-antibody to partition reactive 
polypeptides from non-reactive polypeptides resident 
(as nascent polypeptides) in ribosome complexes. The 
auto-antibodies are used to precipitate the ribosome 
complexes containing polypeptides that fit into the 
active site of the antibody. The most avidly bound 
polypeptide emerges from repeated SPERT cycles. 

The most avidly bound polypeptide ligand does 
not resemble in detail the epitope identified as the 
portion of the target that reacts with the antibody. 
VAuto-immune diseases are triggered by unknown antigens, 
which are not necessarily the same as the 
target/epitope identified as the interactive species 
during the clinical stage of the auto-immune disease. 



BNSOOCID; <WO 9303172A1_L> 




For Example, ..a virus . infection may trigger an immune 
reaction that yields a class of antibodies that cross- 
react with a normal cellular target. Such antibodies 
may bind more avidly to the original, stimulatory, 
5 viral antigen than. to, the epitope on the cellular 
- - ' target. As another .example, the, epitope on the 

-cellular target may not take full advantage of the 
binding site on the antibody. 

The polypeptide ligand is used diagnostically 
10 to measure the quantity of circulating auto-antibody, 
using, e.g., an ELISA assay. The technology is 
available to one skilled in the art, without undue 
experimentation. As another example, the fixed portion 
of the polypeptide ligand is used as the reporter 
15 substance when the polypeptide ligand interacts with 
the circulating auto-antibody . With a fixed carboxy- 
terminus of beta-galactosidase or alkaline phosphatase, 
serum 'protein samples attached to plastic plates are 
assayed directly for the anti-H2B antibody by 
20 "staining" with the polypeptide ligand covalently fused 

(by recombinant DNA techniques) to either reporter 
enzyme. 

Example 3. Indirect Immunoprecipitation: Polypeptide 
25 ligands directed toward domains of any 

protein. 

Immunization of animals with antigens, whether 
crudely prepared or purified, often results in immune 
30 responses directed at a subset of the available 

epitomes in that antigen. The polyclonal sera may 
react largely with a single protein domain in that 
antigen. Similarly, when researchers attempt to raise 
antibodies against fusion proteins, often the well- 
35 known fusion partner is immuno-dominant over the new 
protein portion of the fusion. 

Antibodies aimed at a protein target (but that 
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do nof recognize the portion of the" target that one 
wishes to use as the target in SPERT) allow INDIRECT 
Immunoprecipitation of ribosome complexes. That is, 
immunoprecipitation is a useful partitioning step, when 
antibodies are aimed at domains in the target that are 
different from^those domains pre-selected f or. SPERT- 
based ligand evolution. This protocol is sometimes 
called "panhandling", and can yield high-af f inity 
polypeptide ligands for target domains that are weakly 

immunogenic. 

SPERT is performed using variable material 
prepared as in Example 1 except that the randomized 
mRNA regions are now set to yield about 50 amino acids 
in the solvent-exposed nascent polypeptide. Biased 
randomization is done so that chain termination codons 
are not likely over the 150 randomized nucleotides; in 
addition, cell-free translation is performed in the 
presence of so-called suppressor tRNAs so that 
translation continues to the desired portion of the 

mRNAs. 

The population of ribosome complexes is pre- 
dated with the antisera aimed at the target protein, 
but in the absence of that target protein. The pre- 
treatment is designed to eliminate any nascent 
polypeptides that react directly with the antibodies, 
as in Example 1. The target protein is then added to 
the ribosome complexes, along with antibodies aimed at 
the target protein. Partitioning occurs as the 
ribosome complexes that interact with the target at the 

same time (see Figure 4). 

The single-stranded DNA binding protein of 
bacteriophage T4 (g P 32) has an acidic carboxyterminal 
region which is immunodominant (K. Krassa, Ph.D., 
Thesis, 1987). in one immunization experiment, 
polyclonal sera react exclusively with the 
carboxyterminal domain of the protein; 12 monoclonal 
cell lines derived from hybridoma fusions with spleen 



* cells from such immunized animals- produced antibodies 
that react with the same target domain. Purified 
polyclonal sera which react with the carboxy-terminal 
domain of gp32 are used for indirect 
immunoprecipitation. in this example. 

./>*;,- A population.^ of ribosome, complexes is produced 
(above) . These ribosome complexes are pre-treated with 
the polyclonal sera aimed at gp3 2; this is readily 
accomplished by passing the ribosome complexes through 
Staph A columns pre-bound with the polyclonal sera 
against gp32. Subsequently , those ribosome complexes 
unable to react directly with antibodies raised against 
gp32 are reacted with gp32, followed by treatment with 
the sera aimed at the carboxy-terminus of gp32. Goat 
anti-mouse antibodies are used to immunoprecipitate 
gp32 and whatever ribosomal complexes interact with the 
core domain of gp32. Cycles of SPERT are continued 
until a desired level of binding is attained. 
Sequences are then cloned and individuals identified 
and tested for affinity to gp32. 

Example 4. Isolation of a polypeptide ligand for a 
serine protease. 

Serine proteases are protein enzymes that 
catalyze hydrolysis of peptide bonds within proteins, 
often with high selectivity for specific protein 
targets (and, of course, for specific peptide bonds 
within the target protein) . The serine proteases are 
members of a gene family in mammals. Examples of 
serine proteases are tissue plasminogen activator, 
trypsin, elastase, chymotrypsin, thrombin, and plasmin. 
Many disease states can be treated with polypeptide 
ligands that bind to serine proteases, for example, 
disorders of blood clotting. Elastase inhibitors are 
likely to be useful in minimizing the clinical 
progression of emphysema. Proteases other than serine 
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proteases are^also important -mammalian biology, and 
these too are targets for polypeptide ligands with 
appropriate affinities obtained according to the 
invention herein taught. 

A ligand that binds to porcine elastase is 
identified and'purified using -the. starting randomized 
material of Example 3. Serine proteases are easily 
attached by standard methods to column support 
materials with retention of enzymatic ^tivity. 
Porcine elastase attached to agarose is available from 
commercial sources. Thus, in this example affinity 
chromatography is the partitioning method. Natural 
elastase inhibitors are available, and are used to 
check that the active site of the bound elastase is 
available for the binding of an inhibitory ligana. The 
buffer used for binding during the SPERT cycles must 
not denature or otherwise inactivate elastase; 
dithiothreitol, which can reduce protein disulfide 
bonds, is left out of the binding buffer. 

After several rounds of SPERT, as. the affinity 
of the mixture of nascent polypeptides becomes high, a 
reversal of the elution parameters is used. Early 
rounds of SPERT are aimed at obtaining any polypeptide 
ligand that binds to any domain of elastase ; after 
virtually all the nascent polypeptides are able to bind 
the column, the ribosome complexes are poured through a 
column that has been pre-saturated with a natural 
inhibitory ligand for the elastase active site. In 
addition, the elution buffer for this procedure 
includes high concentrations of that same natural 
inhibitory ligand. The ribosome complexes that are not 
bound in this reversed elution procedure are used to 
prepare mRNAs for further SPERT cycles, once again 
• depending on high affinity for the bound elastase. 
This procedure focuses the evolving polypeptide ligands 
toward the elastase active site. 

When the mixture of polypeptide ligands has a 
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r hi^h affinity >f or the^bound' elastase, and is aimed 
primarily toward the active site, further enrichment 
for high affinity inhibitors of elastase activity is 
accomplished by including low concentrations of the 
natural inhibitors .in , the partitioning steps , thus 

"demanding that' the evolving polypeptide, ligands have' 
higher affinity than the effective affinity of the 
natural inhibitor at the concentration used. 

Nucleic acids encoding polypeptide ligands are 
cloned and sequenced, and binding affinities and 
inhibitory binding affinities for elastase are 
measured. Binding affinities and inhibitory 
efficiencies are measured with the same polypeptide 
ligands for other members of the serine protease family 
in order to ascertain specificity within the family. 

Example 5. Polypeptide ligands that antagonize a 

receptor: A synthetic inhibitor of the 
interleukin-1 receptor. 

Receptors are a class of proteins that are 
partially integrated into the cell's cytoplasmic 
membrane such that a domain resides outside the cell. 
That domain serves as a binding site for cell extrinsic 
molecules, including growth factors, peptide hormones, 
non-peptide organic molecules (which may include 
hormones), or even ions. Receptors handle the bound 
ligand in several different ways, including signal 
transduction through the membrane or internalization of 
the bound ligand for its subsequent function. In 
either case polypeptide ligands of the invention may be 
used to affect function of the receptor, that is to 
cause the normal activity of the natural ligand or to 
block that activity. 

Receptor antagonism for a useful therapeutic 
purpose is accomplished by generating a polypeptide 
ligand through SPERT that is aimed at the interleukin-1 
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■ * a natural: antagonists the-,reoeptor 
(IL -i) receptor. * natural 9 ^336-340. 
ha s been found (Hannum *t ,«-,«. . 

~ ::::^ve - 

^ a ; i \^::" ^--.ptor.nt.aoni.t, 
antagonist (called IL Ira f ^ 

iS ^^ ^ ^ " ~ 
competitive inhibit or of antagonist, 

receptor. The natural IL-lra « P t 

completely without a g ons activ^ ^ ^ 

concentrations used m the wo ^ 
^esi.ed as a protein^ a*h» a ^ 
post-translational cleavage the lated . 

«, amino acids and, ^.^a, without 

ar-tivitv of recombinant 11. J-"/ 
However, the activity ac tivity of the 

glycosylate, is comparable to tn 

natural inhibitor. ? polype ptide ligand 

SPERT is used to devei P t> met hods 
• 4- for- the interleukin-1 receptor. Two m 

antagonist for the interx ant ibodies are 

are used, in the ^ ble t0 cross- 

raised against interleuKm-1 thai *~ 

^ -4.K TT-i-ra Such monoclonal anti»«« 
react with IL Ira. S b((tu>en ^ 

principle recognize the featur ^ as 

a„a IL-lra. Those »° n ° cl ° n ^"" Uganas tha t bind 
in Example L to develop P= ^ " ptlUtide ligands 
to the antigen combining site, antagonls ts. 

are candidates ^ a."™ / ^Provide antagonists 
Since one goal in this case is t ^p fl 
smaller than the natural IL ira t ^ 
polypeptide is ca. 50 amino ^ acic^ 

in a second methodology the extr 

• f the IM receptor is itself used as the 
domain of the IL 1 «c P through 
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a* 67 ' - 
* is >eluted with high concentrations of IL-1, thus 
' displacing the ribosome complexes and nascent 

polypeptides with the natural ligand known to bind to 
the desired active site on the receptor. Cycles of 
5 SPERT are continued until high affinity polypeptide 
ligands "are identified. ^ ' . .~ 

Very' high affinity, even covalent, antagonists, 
of the receptor are isolated by an elution protocol 
during SPERT that denatures the ribosome complexes even 
.0 if the polypeptide ligand remains strongly bound to the 
receptor. The mRNA eluted from the column under 
protein denaturing conditions is used to prepare cDNA 
which is amplified through PCR, after which 
transcription provides mRNA for the next round of 
15 SPERT. 

All genes encoding polypeptide 'ligands are 
sequenced, and the polypeptide ligands are tested for 
IL-1 receptor antagonism. Those ligands identified by 
receptor-based affinity chromatography are tested with 

20 the antibodies of the first method to screen for the 

novel antagonists recognized by those antibodies that 
recognize structural or sequence homology between IL-1 
and IL-lra. Novel, SPERT-generated polypeptide ligands 
having IL-1 receptor antagonist activity are isolated 

25 and characterized. SPERT-generated antagonists having 
less than 50% amino acid homology with natural IL-lra 
are identified. In addition, SPERT-generated 
antagonists having less than 30% amino acid homology 
are identified, 

30 

Example 6. Protein improvement by SPERT: Mutagenesis 
and selection of better natural 
insecticides. 
Bacillus thuriengiensis is a gram-positive, 
35 spore-forming bacteria which produces insecticidal 

proteins. These proteins, derived from different B. 
thuringiensis strains, have varying effectiveness for 
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Killing "insect larvae o£ different species Although 
Z specific protein will Kill larvae of. a 

variety of species, the effective-ess toward the 
Afferent insect targets Measured as the level of. 
protein required to produce 50*. mortality, can, vary by 
L much as 2 ooo-fold. The.-echahism of action .for. 

ese insecticide proteins is to hind, a rece pt or on the 
^t membranes of the susceptible msect larva Such 
Cranes serve as a functional partit.onm, tool a 

We create double-stranded DNA templates 
suitable for SPERT by PCR; the appropriate DNA encodes 
the N-terminal 646 amino acid portion of the 

. * *. c iiVic:necies kurstaki HD-1, 
insecticidal protein from t. subspecies 

. ,t,- ^vmf f et al.. Biotechnology 
which is fully active (Fischhoff et ai., 

5-807-813 (1987). This protein kills the larva of 
tcmaL hornworm and cabbage looper very effectively at 
low concentration. Substantially more protein is 
required to kill tobacco budvorm, corn earworm, black 
cJworm, European cornborer, and beet armyvorm Gut 
ro embranes from each of these insect larvae vill be used 
as partitioning agents in SPERT. 

The starting material in these experiments is 
RNA derived from the cloned gene, as above. Two 
me thods are used to create protein variants. In one 
m ethod mutagenic PCR provides random mutations 
thr oughout the 646 amino acids of the -secticide^ In 
fixed codons within the insecticide, using about 50 
Lno acid replacements. In particular, randomized DNA 
is used to replace the codons encoding the 
hyp ervariable region of the Bt. toxin. Rounds of SPERT 
are continued until a desired level of binding to gut 
Lbranes is achieved. The DNA products are cloned and 
fenced and individually assayed for effectiveness in 
lading membranes and larval killing. Effective toxins 
are selected by SPERT, having a naturally-occurring 
sequence replaced by a sequence that is less than 50, 
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■ >: * homologous with the* replaced • sequence. In addition, 

toxic, SPERT-generated variants are identified wherein 
the original, naturally-occurring sequence is replaced 
by a sequence having less than 30% sequence homology 

5 with the replaced sequence. 

Example 7. 1 Anti-viral polypeptide ligands: 

Inhibition of virai entry into target 
cells. 

10 Receptors are often used for viral attach on 

cells. Recently Kaner et al. (Science, 241:1410-1413 
(1990)) described the basic fibroblast growth factor 
(FGF) receptor as the likely portal through which 
Herpes Simplex Virus Type 1 (HSV) enters a cell. In 

15 that same paper, by citation of other work several 

other viruses are said to utilize other receptors to 
gain cellular entry. Rhinovirus, the common cold 
virus/ is said to enter cells through a cell adhesion 
molecule ICAM-1. HIV, the AIDS virus, enters cells 

20 through the CD4 glycoprotein receptor. Epstein-Barr 
virus enters T lymphocytes via the C3d complement 
receptor. Rabies virus enters nerve cells through the 
acetylcholine receptor. Reovirus enters cells through 
the beta-adrenergic rjeceptor. Vaccinia virus enters 

25 cells through a functional interaction with the 

epidermal growth factor receptor. Apparently viruses 
survive in part by using absolutely crucial cell 
receptors to gain entry into susceptible hosts- That 
is, host organisms can not easily alter such important 

30 receptors so as to become resistant to the virus 

without suffering some impairment of crucial cell and 
organism functions . 

Polypeptide ligands of the invention are 
identified that diminish viral uptake through receptors 

35 while still allowing critical growth factors to 
function. The basic FGF receptor is used to 
demonstrate a successful strategy. The soluble domain 
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of "the basic FGF receptor (Lee< et ali /■ Sciehce, -24*57 
(1989)) is used as the target. A candidate mixture of 
polypeptide ligands is used as in Example 3. The 
partitioning of ribosome complexes is obtained with 
matrix bound extracellular domain of the FGF receptor . 
The cycles of.SPERT.are altered to include an elution 
step from the matrix with high concentrations of HSV; 
during this elution step the ribosome complexes that 
exit the column are discarded, while those ribosome 
complexes that remain on the column are further eluted 
with high concentrations of FGF itself.. Those ribosome 
complexes that are not displaced by HSV but are 
displaced by FGF contain nascent polypeptides that are 
candidate ligands with the desired specificity. Such 
polypeptides bind FGF receptors in a way that inhibits 
HSV binding but does not interfere with FGF binding. 
Several cycles of SPERT are used to find the most 
avidly bound polypeptide that it eluted with FGF but 
not. with HSV. candidate polypeptides are assayed, for 
their negative impact on HSV infection and their 
inability to prevent FGF-mediated cell growth. The 
xnost useful polypeptide ligands in this example are 
neither antagonists nor agonists of the FGF receptor at 
concentrations that diminish HSV infection. Novel 
polypeptides meeting these criteria are made using the 
process as described. A polypeptide meeting the 
criteria having less than 50% amino acid homology with 
FGF is isolated. In addition, a polypeptide meeting 
the criteria having less than 30% homology with FGF is 
isolated. 

Example 8. Polypeptide ligands that enter cells: The 
glucocorticoid receptor and trojan horse 
ligands. 

The glucocorticoid receptor protein binds 
steroid hormone, after which the receptor protein is 
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internalized from-the- membrarie so that, the- receptor can 
make i€s way into the cell nucleus. The receptor has a 
DNA binding domain (DBD) that interacts in the nucleus 
with target DNA sequences. Polypeptide ligands of the 
invention, agonists of the glucocorticoid receptor, are 
irt£efnali2ed A al6ng*with' the .receptor , and thus^ directed 
sequentially to the cytoplasm and then to the nucleus., 
Depending on the dissociation rate constant for 
specific polypeptide ligands, these ligands largely 
reside after uptake in either the cytoplasm or the 
nucleus. 

Using the randomized starting material of 
Example 3, SPERT is directed toward the glucocorticoid 
receptor, either with indirect immunoprecipitation or 
affinity chromatography using bound receptor. As in 
prior example, SPERT protocols are manipulated so that 
polypeptides are found that compete directly for the 
glucocorticoid binding domain but that have much lower 
affinity than that observed for steroid hormones. As 
the polypeptide ligands evolve, screening of potential 
ligands is performed on individual candidates; thus 
resistance to proteolysis of the polypeptide ligand is 
tested using whole cell entry prior to the protease 
challenge, and testing both cells with and without an 
abundance of the glucocorticoid receptor. Polypeptide 
ligands that enter cells are localized in the cytoplasm 
or nucleus by means available to those skilled in the 
art. Those polypeptide ligands that enter cells with 
proper localization are fused to other polypeptide 
ligands to provide cell entry for molecules with other 
useful activities. 

Example 9. Polypeptide ligands toward nucleic acids: 
Inhibitors of transcription. 

Cancer cells can result from the over- 
expression of a transcriptional activator protein that 
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functions to • enhance transcription and .subsequent 
expression of sets of genes that push the cell toward 
inappropriate and uncontrolled growth. Thus, nutations 
that elevate the activity of a transcriptional enhancer 
raa y cause cancer through enhancement of the expression 
-of a set of genes relevant for growth control. Such 
tumors are treatable with polypeptide ligands that 
reset the appropriate level of expression or activity 
of the transcriptional enhancer, while it is likely 
that polypeptide ligands may be aimed at the enhancer 
protein directly, thus inhibiting the activity and 
resetting a proper growth rate, in the present example 
a polypeptide ligand is aimed at the production rate of 
the transcriptional enhancer. 

The polypeptide ligand of interest binds to the 
genome of the cancer cell at a location that competes 
for transcription of the gene that encodes the 
transcriptional activator protein, and hence expression 
of that protein. That is, in classical genetic 
language, the polypeptide ligand is a specific 
transcriptional repressor. 

The starting materials of Example 3 are used to 
generate a mixed pool of candidate polypeptides. A 
specific sequence of double-stranded DNA is prepared by 
chemical means and covalently attached to an insoluble 
column matrix. The column matrix is chosen such that 
ribosome complexes in general are able to. flow through 
the column containing bound DNA. Ribosome complexes 
containing nascent polypeptide ligands that interact 
with double-stranded DNA (either with sequence 
specificity or not) are retarded on the column, 
recovered, and placed into the SPERT protocol of mRNA- 
amplification, transcription, and a second cycle. In 
order to eliminate polypeptide ligands with affinity 
for all double-stranded DNA (that is, without adequate 
sequence specificity for the intended use) , the 
ribosome complexes are mixed with random soluble 
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double-stranded DNA sequences • prior to the^column 
partitioning step. The. soluble .DNA concentration is 
adjusted to give about tenfold more non-specific DNA 
during the partitioning step than is the abundance of 
specific DNA 'sequences attached to the column. In this 
banner polypeptide : l'igands d that v .are' indifferent to DNA 
sequence emerge from the column along with ribosome 
complexes containing polypeptide ligands that are 
unable to bind DNA at all. 

Polypeptide ligands aimed at a specific DNA 
sequence are characterized further. Randomized DNA 
sequences are used to establish which nucleotide pairs 
in the covalently attached DNA are required for avid 
binding of the polypeptide (using the SELEX protocol 
described in U.S. Patent Serial No. 07/536,428). A 
second SPERT is directed toward the contiguous DNA base 
pairs that are not bound by the first isolated 
polypeptide ligand, and the genes for the first and 
second polypeptide ligands are combined to yield a 
polypeptide ligand fusion (in either order, and 
containing a flexible peptide linker) to provide a 
polypeptide ligand with higher specificity and avidity 
than is available from either polypeptide ligand by 
itself. This improvement in specificity and avidity is 
an example of walking, although in this case the 
"steps" are made independently and the polypeptide 
ligands joined post-identification. 

The sequence of double-stranded DNA chosen in 
this example must overlap a transcriptional initiation 
signal. The ras oncogene transcriptional initiation 
region is chosen first. 

Example 10. Human c-myc protein epitope. 

This experiment shows that it is feasible to 
select an epitope or epitopes from a random mixture of 

RNA-encoded peptides. An antibody was chosen which 
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"loTsi ! IP..). - egression system may beset up 

„ In™, SPERT experiments utilizing a T7 
f or conducting SPERT, expe _ ,^ UTR) , containing 

promoter,'^ 5- untranslated region, (5 u 
lianals for either eukaryotic or prokaryotic 
signax&» . • _ cites for random or 

translation^ initiation, insertion sites for 
n on-ranao* seances which would encode "ascent 
""tides accessible to section on ribosoes and a 
fixed translated sequence (3 '-FTR) which ««-*- 
fixed w b uried in the translating 

peptide -^—^^ b ^ he T7 promote r sequence 
rib osome. Refer to Table 3. T ^ T > pcR with 

waS added to the eukaryotic 5' UTR thro g 
oli gos 1 and 2 fro, Table 3 using pl—ld P«« 
i. described by Siegel and Walter, (S4L 52 3 49, 

^o« R i The 3--FTR was obtained by PCR of the same 
1988). The P 3> These two 

plasmid using oligos 9 ana iu 

* e s'-UTR and 3'-FTR were cut with Nhel and 
fragments, 5 and = purified and further 

, • The lioated fragment was purm-cu 

ligated. The ugat y „ and BamHI sit es 

PCRd prior to cloning into the Hindiii 
pcro pti ci-T-ateaene Systems, Inc.) to 

of pBSSK+ (purchased from Strategene ay 
create the plasmid pPSX-EUK. The prokaryotic 5 '-UTR 
lie cloned using oligos 3 and 4 from Table 3 into 
In HindXII and Nhe X site of pPSX-EUK to create P sx 

PROK replacing the eurkaryotic ribosome binding site 
1 Hrokarytic one. The myc epitope encoding insert 

with a proKaryr. .^late oligo 7 with the 

is obtained by PCRing the template oiig 

oLLs 5 and 6, all from Table 3, and the variable 

in e* (for eight amino acids) is obtained by PCRing 
insert (ior y ^ g frQm 

the template oligo 8 with ^ o g ^ ^ 

Table 3. These inserts will be dig 

EcoRI and ligated in the presence of likewise tug 

P P X-EUK and pPSX-PROK. (This was done for the myc 

I'sert in pPSX-EUK) . Thus there will be a positive 

epitope-ncoding expression system which can be 
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translated by eukaryotic translation systems and 
separately by . prokaryotic translation systems, and 
variable nascent peptide-encoding system which can be 
likewise variably translated, and a system with no 
inserts which; can serve as an internal control for 
comparing extents, of .^ic^nts^by selection of 
polysomes by the anti-myc antibody. Further testing 
will identify what 3' ends will give the stablest 
polysome complexes; this may be accomplished by using 
oligos 10- in PCR (with oligo 1) to create multiple 
histidine codons for translation with no added 
histidine, with oligo 11 for normal unstopped 
translation with no amino acid depletion, and to test 
the extent of translation using oligo 12 which puts two 
stop codons allowing repeated translation of individual 
mRNAs. 
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method for naXing • **ai#ito of a 

target molecule comprising: 

a) synthesizing a translatable mRNA mixture, 
comprising a ribosome binding site, 
translation ; initiat^n* codow aiid % 
randomized sequence coding region? 

b) synthesizing a mixture of ribosome 
complexes, each member thereof comprisxng 
a ribosome, a nascent polypeptide and a 
translated mRNA, said mRNA having a 
randomized coding region and said nascent 
polypeptide being the translation product 

of said mRNA; 

c) partitioning the ribosome complexes with 
respect to binding of the ribosome 
complexes to a desired target molecule, 
thereby separating 1 the ribosome complexes 
into ribosome complex-target pairs and 
unbound complexes, the ribosome complex- 
target pairs having mRNA enriched for 
sequences encoding target-binding 
polypeptides ; 

d) amplifying %he mRNA of partitioned 
ribosome complex-target pairs to yield a 
translatable mRNA mixture comprising a 
ribosome binding site, an initiation codon 
and a coding region enriched for sequences 
encoding target-binding polypeptides; 

e) repeating steps b) through d) using the 
mRNA enriched for sequences encoding 
target-binding polypeptides of each 
successive repeat as many times as desired 
to yield a desired level of target binding 
by a polypeptide encoded by the mRNA 
enriched for sequences encoding the 
polypeptide; and 
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f) synthesizing a ' polypeptide encoded by the 
enriched mRNA of -step e) , thereby making a 
polypeptide ligand of a target molecule. 



10 



15 



20 



25 



The method for selecting a polypeptide ligand 
of a desired target molecule from a polypeptide 
mixture comprising: 

a) synthesizing a polypeptide mixture each 
member thereof having attached thereto 
amplifying means for separately amplifying 
the individual polypeptide to which it is 
attached ; 

b) partitioning the polypeptide mixture with 
respect to binding the target molecule, 
thereby separating the mixture into 
polypeptide-target pairs and unbound 
polypeptides ; 

c) amplifying the polypeptides of 
polypeptide-target pairs using said 
amplifying means; and 

d) repeating the partitioning and amplifying 
steps to select a polypeptide ligand of a 
desired target molecule. 

The method of claim 2 wherein the polypeptide 
mixture comprises polypeptides having a segment 
of randomized amino acid sequence. 



4. The method of claim 3 wherein the segment of 

30 randomized amino acid sequence is from 4 to 50 

amino acids in length. 



5. The method of claim 3 wherein the amplifying 

means comprises an mRNA mixture, each member 
35 thereof encoding a polypeptide of the 

polypeptide mixture and being attached to the 
polypeptide it encodes as part of a ribosome 
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The method of claim 3 wherein the step of 
amplifying the polypeptides comprises the 
additional step of amplifying the mRNA mixture. 

The method of claim 6 wherein the mRNA mixture 
is amplified by reverse transcription and a 
polymerase chain reaction. 

A method for making a polypeptide ligand of a 
target molecule comprising: 

(a) synthesizing a mRNA mixture comprising 
translatable and nontranslatable regions, 
wherein said translatable region comprises 
randomized and fixed sequence coding 
regions ; 

(b) synthesizing a mixture of mRNA • polypeptide 
copolymers, each member comprising an mRNA 
and a polypeptide encoded by its 
associated mRNA, wherein a portion of said 
nontranslatable region of said mRNA and a 
portion of said polypeptide encoded by 
said fixed sequence coding region form a 
binding interaction; 

(c) partitioning the mRNA- polypeptide 
copolymers with respect to affinity of the 
copolymers to a desired target molecule; 

(d) amplifying the mRNA of partitioned 
copolymers to yield a translatable mRNA 

mixture? and 

(e) synthesizing a polypeptide or polypeptides 
encoded by the mRNA mixture of step (d) . 

The method of claim 8 further comprising the 
steps of repeating steps (a) through <d) using 
the mRNA mixture of step (d) in successive 
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'cycles repeating as many times as desired to 
yield copolymers with, the desired affinity to 
the target. 



5 10. The method of claim 8 wherein the target 
molecule is a protein. 



10 



11. The method of claim 10 wherein the protein is 
an enzyme. 

12. The method of claim 10 wherein the protein is 
an antibody. 



13. The method of claim 10 wherein the protein is a 
15 receptor. 

14. The method of claim 10 wherein the protein is a 
nucleic acid binding protein. 

20 15. The method of claim 10 wherein the protein is a 
toxin. 



25 



16. The method of claim 10 wherein the protein is a 
glycoprotein. 

17. The method of claim 10 wherein the protein is 
an antigen. 



18. The method of claim 8 wherein the polypeptide 
30 is an inhibitor of function of the target 

molecule. 



35 



19. 



20, 



The method of claim 8 wherein the target 
molecule is a cell membrane component. 

The method of claim 8 wherein the target 
molecule is a virus component. 
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Wmethod of claim BWhereimthe target 
molecule is a carbohydrate. 

The method of claim 8 wherein the target • 
molecule is ,a polysaccharide * 

r « .it. ■- ' 
The method of claim 8 wherein the target 
molecule is a lipid. 

The method of claim 8 wherein the target 
molecule is a glycolipid. 



25. The method of claim 8 wherein the target 
molecule is a toxin. 



The method of claim 8 wherein the target 
molecule- is a drug. - 



27. The method of claim 8 wherein the target . 
molecule is a controlled substance. 



The method of claim 8 wherein the target 
molecule is a metabolite. 

The method of claim 8 wherein the target 
molecule is a cof actor. 

The method of claim 8 wherein the target 
molecule is a nucleic acid. 

The method of claim 8 wherein the target 
molecule is a hormone. 

The method of claim 8 wherein the target 
molecule is a receptor ligand. 

The method of claim 8 wherein the target 
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uii r *v t > . <:i molecule* is a .transition state analog, 

34. The method of claim 8 wherein the partitioning 
is carried out by column chromatography. , 

5 

35. The method .of* claim 8> wherein, the partitioning 
i is carried out by binding to target molecules 

attached to a solid phase matrix. 

10 36. The method of claim 8 wherein the partitioning 
is carried out by immunoprecipitation. 

37. The method of claim 8 wherein the partitioning 
is carried out by indirect immunoprecipitation. 

15 

38. The method of claim 8 wherein the mRNA is 
amplified in step d) by polymerase chain 
reaction, 

20 39. The method of claim 8 wherein the process of 

amplifying in step d) includes introducing 
mutations during amplification. 

40. The method of claim 8 wherein step f) is 
25 carried out by chemical synthesis of the 

polypeptide ligand. 

41. The method of claim 8 wherein the mRNA 
additionally comprises a sequence encoding a 

30 segment of polypeptide that functions to bind a 

bridging molecule and step c) further comprises 
binding target molecules to a solid phase 
matrix and binding to the target molecules an 
anchor molecule covalently bound to the 

35 bridging molecule, the anchor molecule being 

capable of specifically binding the target 
molecules whereby mRNA •polypeptide copolymers 
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'.-» Vind to the bridging' molecule anchored to. the 
target molecules. 
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The method of claim 8 comprising, the additional 
steps of synthesizing a second translatable 
mRNA' mixture comprising -the mRNA selected by 
steps a) - e) and a .second randomized sequence 
coding region, and repeating steps b) - e) 
using the second translatable mRNA mixture to 
yield a desired level of target binding by a 
polypeptide encoded by the second mRNA enriched 
for sequences encoding the polypeptide. 

A mixture of mRNA • polypeptide copolymers 

comprising: 

an mRNA comprising nontranslatable 
portions and translatable portions; 

a polypeptide encoded by said mRNA 
comprising random and fixed sequence regions, 
wherein said mRNA and polypeptide are bound 
together by at least a portion of the 
nontranslatable portion of said mRNA and at 
least a portion of the fixed sequence region of 
said polypeptide. 

A polypeptide that is a ligand of a target 
molecule prepared according to the method 
described in claim 8. 

A method for making a polypeptide ligand of a 
target molecule comprising: 

(a) synthesizing a mRNA mixture of at least 
10 14 sequences comprising translatable and 
nontranslatable regions; 

(b ) synthesizing a mixture of mRNA- polypeptide 
copolymers, each member comprising an mRNA 
and a polypeptide encoded by its 
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associated mRNA,. and not containing a 
ribosoroe; 

(c) partitioning the mRNA' polypeptide 
copolymers with respect to affinity of the 
copolymers to a desired target molecule; 

(d) amplifying the mRNA of partitioned 
copolymers to yield a translatable mRNA 
mixture; and 

(e) synthesizing a polypeptide or polypeptides 
encoded by the mRNA mixture of step (d) . 

The method of claim 45 wherein said 
mRNA 'polypeptide copolymers are synthesized by 
the post-translational or co-translational 
interaction between a portion of the 
nontranslatable portion of said mRNA and a 
portion of said polypeptide. 

The method of claim 45 wherein said 
mRNA 'polypeptide copolymers are synthesized by 
crossl inking the polypeptide-tRNA-mRNA complex 
after translation of the mRNA. 



48. The method of claim 45 wherein said 

mRNA 'polypeptide copolymers are synthesized by 
linking the 5' nucleic acid sequence of the 
mRNA to the initial amino acid sequences of the 
polypeptide prior to translation. 
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